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Abstract 

We propose a new estimator for the integrated covariance of two Ito semimartingales observed at a high- 
frequency. This new estimator, which we call the pre-averaged truncated Hayashi-Yoshida estimator, enables 
us to separate the sum of the co-jumps from the total quadratic covariation even in the case that the sampling 
schemes of two processes are nonsynchronous and the observation data is polluted by some noise. It is the 
first estimator which can simultaneously handle these three issues, which are fundamental to empirical studies 
of high-frequency financial data. We also show the asymptotic mixed normality of this estimator under some 
mild conditions allowing infinite activity jump processes with finite variations, some dependency between the 
sampling times and the observed processes as well as a kind of endogenous observation errors. We examine the 
finite sample performance of this estimator using a Monte Carlo study. 

Keywords: Hayashi-Yoshida estimator; Integrated covariance; Jumps; Market microstructure noise; Nonsyn- 
chronous observations; Pre-averaging; Threshold estimator. 



1 Introduction 

In the past years there has been a considerable development in statistical inferences for the quadratic covaria- 
tions of semimartingales observed at a high frequency. This was mainly motivated by financial application because 
price processes need to follow a semimartingale under the no-arbitragc assumption (see [17] for instance) and tech- 
nological developments made high frequency data commonly available. In general the quadratic covariation of two 
semimartingales consists of two sources; the continuous martingale parts and the co-jumps of the semimartingales. 
Recently many authors have indicated that separating these two sources benefits various areas of finance such 
as volatility forecasting (Andersen et al. [2]), credit risk management (Cont and Kan [16]), the construction of a 
hedging portfolio (Todorov and Bollerslcv [42]) and so on. Motivated by these reasons, in this paper we focus on 
disentangling these two components of the quadratic covariations of two semimartingales by using high-frequency 
observation data. 

Let and Z'^ be two Ito semimartingales and let (5'*)igz+ be a sequence of stopping times that is increasing 
a.s., 5"* t oo, and = 0. Then it is well-known in the classic stochastic calculus that 

{Zl.-Zl.^,){Zl~Z%^,)^^[Z\Z% (1.1) 

i:S*<t 

for any f > 0, provided supjgpj(S" At — S"*"^ A t) -^p 0. Therefore, if wc observe Z^ and Z^ at the time for 
every i, we can use the statistic in the left hand of the equation (1.1) (which is called the realized covariance) as a 
consistent estimator of the quadratic covariation [Z'^,Z'^]t of Z^ and Z^. Since 

[Z\Z\^{Z^^-,Z^'-)t+ J2 ^ZlAZl (1.2) 

0<,'5<t 

our aim will be achieved by constructing an estimator for the quantity (Z^'"^, Z^''^)t which we call the integrated 
covariance of Z^ and Z^. In the present situation we have the observation data {Z^i +Z^i )igz^ and {Z^i — Z^i )iez+ , 
so that the problem results in the univariate case due to the polarization identity {Z^''^, Z'^''^) — {{Z^'^ + Z^'^) — 
(Z^'^ — Z^'^))/4. As a consequence, we can benefit from a vast numbers of studies on detecting jumps in a single 
financial high-frequency data. For example, Barndorff-Nielsen and Shephard [6] used such a method based on the 
bipower technique introduced in [7] and proposed an estimator called the realized bipower covariation. On the other 
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hand, there are several approaches which directly treat the multivariate data; see Mancini and Gobbi [33] and Boudt 
et al. [12] for example. 

In real financial markets, however, some difficulties caused by the so-called market micro structure confront us. 
In the present context there arc two major topics related to them: One is the nonsynchronicity of observation times 
and the other is a kind of observation errors called microstructure noise. In recent years the simultaneous treatment 
of these two problems, which is based on the combination of methods for dealing with each individual one, has been 
established by many authors in the case that jumps are absent. See Barndorff-Nielsen et al. [5], Bibinger [10] and 
Christensen et al. [13] for example. Furthermore, in the presence of one of the above issues the methodologies of 
detecting jumps have been also studied in the literature. In the case that observation times are nonsynchronous, 
Mancini and Gobbi [33] combined the Hayashi- Yoshida method proposed in Hayashi and Yoshida [22] to deal with 
the nonsynchronoicity with the thresholding technique proposed independently in Mancini [32] and Shimizu [40] to 
detect jumps and constructed a consistent estimator for the integrated covariance. That estimator, which we call 
the truncated Hayashi- Yoshida estimator, was also studied in [30]. On the other hand, in Podolskij and Vetter [38] 
they proposed a new method for dealing with microstructure noise and introduced a class of bipower-type statistics 
which goes well in the presence of microstructure noise. Their method is now called the pre-averaging method 
and has been further investigated in [25] and [38] for example. Fan and Wang [19] proposed another approach for 
detecting jumps in the presence of microstructure noise, where wavelet methods were applied to detect jumps. Their 
approach has further been developed by [8]. In contrast, we remark that relatively few papers are so far available 
in a parametric setting with respect to these topics; we refer to Ogihara and Yoshida [35] for interested readers. 

In the present article we investigate the methodology accommodated to the situation that all of the above 
problems are present simultaneously. That is, we consider two Ito semimartingales which are observed at stopping 
times in a nonsynchronous manner and contaminated by noise. Then we develop a method for estimating their 
integrated covariance separately from the sum of their co-jumps. For this purpose, we combine the Hayashi- 
Yoshida method (to deal with the nonsynchronicity of the observation times) and the pre-averaging method (to 
remove the noise) with the threshold technique (to separate the jumps) and consider a class of statistics called the 
pre-averaged truncated Hayashi- Yoshida estimator. We prove the consistency and the asymptotic mixed normality 
of the pre-averaged truncated Hayashi- Yoshida estimator under a very general situation allowing the presence of 
infinite activity jumps, some dependency between the observation times and the observed processes as well as a 
kind of endogenous noise. 

This paper is organized as follows: In Section 2 we briefly review on the results about the asymptotic properties 
of the pre-averaged Hayashi- Yoshida estimator in the continuous Ito semimartingale setting. In Section 3 we present 
the construction of our estimator and the main results in this paper. We discuss some topics for the statistical 
application to finance of our estimator in Section 4, while Section 5 provides some numerical experiments to illustrate 
the finite sample properties of our estimator. Most of the proofs are postponed to the Appendix. 

2 A brief review of the continuous case 

We start by introducing an appropriate stochastic basis on which our observation data is defined. Let B^'^'^ = 
(f](o), J-("),F(o) = (J-f^)teR^,p(o)) be a stochastic basis. For any t G M_|- we have a transition probability 
Qt(w(o),dz) from (f^^o), J"/°^) into R^, which satisfies 

J zQt(w(°),dz) = 0. 

We endow the space Q^'^'' = (R2)[o,oo) ^ith the product Borel cr-field J"(i) and with the probability Q(w(°), dw^^^) 
which is the product (^teR+Qtii^^^K ■)■ We also call (Ct)tGR+ the "canonical process" on (r2(i), J"(i)) and the fifia- 
ration Tj:^"^ = ^(Cs; s <t). Then we consider the stochastic basis B = {Q, T,F = (J^t)tgR^ , P) defined as follows: 

n = X fid) , J- = j-(o) ® j-(i) , j^t = n,>t j-^o) ® j-ji) , 

P(dc^(°),dc^(i)) = p(o)(dc^(°))Q(t^(o),dc^(i)). 

Any variable or process which is defined on either O'-'^' or fi^) can be considered in the usual way as a variable or 
a process on fl. 

Next we introduce our observation data. There are two continuous semimartingales = (X(^)tgR^ and 
X^ = (Xt^)tgR^. on i5(°) with canonical decompositions 

X' = A'-FM', ^ = 1,2, (2.1) 
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where and are continuous F^^^-adaptcd processes with locally finite variations, while and AI^ are 
continuous F'"'-local martingales. We have two sequences of F^^^-stopping times {S'^)i^z+ and (T-')jgz+ that are 
increasing a.s., 

S"* t oo and t oo. (2.2) 

As a matter of convenience we set S^^ = T^^ = 0. These stopping times implicitly depend on a parameter n e N, 
which represents the frequency of the observations. Denote by (&„) a sequence of positive numbers tending to as 
n — cx) (typically 6„ = n^^). Let ^' be a constant satisfying < ^' < 1. In this paper, we will always assume that 

r„{t) := sup {S' A t - S'^^ A i) V sup {T^ A t - T^'^ At) = Op{b() (2.3) 
as n ^ oo for any t € R+. 

The processes and are observed at the sampling times (S*') and (T-*) with observation errors (f7^i)iGZ+ 
and {U^j ) j6Z+ respectively. In this paper, we assume that the observation errors have the following representations: 

U's. = b-'/^{Xl. ~ X's.-,) + , C/^. = b-'/'{2a, - X|,-0 + Ch ■ (2.4) 

Here, (t = iCtiCt) ^'^^ each t, while 2L^ and X^ are two continuous semimartingales on B'^°\ After all, we have the 
observation data = (X^i )iGZ+ and X^ = (X^^ )jez+ of the form 

Our aim is to estimate the integrated covariance [X^, AT^Jt of X^ and X^ at any time t G IR+ from the observation 
data (X^i)j.5i<t and 0^^j)j:Ti <t- It is necessary to deal with both of the observation noise and the nonsynchronicity 
of the observation times simultaneously. As is mentioned in the introduction, we use the pre-averaging technique to 
remove the noise, while use the Hayashi-Yoshida method to deal with the nonsynchronicity. For the pre-averaging 
technique we introduce some notation. We choose a sequence fc„ of integers and a number 6 e (0, oo) satisfying 

fc„ = 06-1/2 + 0(6-1/4) (2.5) 

(for example fc„ = \9bn^^'^~\). We also choose a continuous function g : [0, 1] M which is piecewise with a 
piecewise Lipschitz derivative g' and satisfies 



5(0) = .g(l) = 0, i^HY / g{x)dx ^ (2.6) 

Jo 

(for example g(x) = x A (1 — a;)). We associate the random intervals /* = [S"*-!, S*') and — \T'^''^ ,T'^) with the 
sampling scheme (S'*) and (T^) and refer to I = (/*)igN and J = (J^)jgN as the sampling designs for X^ and X^. 
We introduce the pre-averaging observation data of X^ and X^ based on the sampling designs T and respectively 
as follows: 



p=l ^ ' g=l 



i,j = 0,1,. 



The following quantity was introduced in Christensen et al. [13] 



Definition 2.1 (Pre-averaged Hayashi-Yoshida estimator). The pre-averaged Hayashi-Yoshida estimator, or pre- 
averaged HY estimator of X^ and X^ associated with sampling designs I and is the process 

^ oo 

PHY{X^,X'^;I,J)^ ^ (^gyfc y X (I)'X (J')-'l|[si^si+fc„)n[7^j_Tj+fc„)-^0i,, t e M+. 

S'+'=" VTJ + '="<t 

For a technical reason explained in [31], we modify the above estimator as follows. The following notion was 
introduced to this area in Barndorff-Nielsen et al. [5]: 

Definition 2.2 (Refresh time). The first refresh time of sampling designs X and J' is defined as = 5° V T", and 
then subsequent refresh times as 

:= min{5*|S" > R''-^} V min{TJ|T^ > R''-^}, fc = 1, 2, . . . . 
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Wc introduce new sampling schemes by a kind of the next-tick interpolations to the refresh times. That is, we 
define S" S°, f° T", and 



1,2,.. 



Then, we create new sampling designs as follows: 

P := [^^-1,5'=), P [f'^-\f^-), I:=(?),gN, J := {J')j<,f,. 
For the sampling designs I and obtained in such a manner, we will consider the pre-averaged HY estimator 

Now we review the results related to the consistency and the asymptotic mixed normality of the estimator 
PHY{X^ ,X^)^ . We write the canonical decompositions of X} and as follows: 



X' = A'+M', 1 = 1,2. 



(2.7) 



Here, A} and are continuous F'^"^ -adapted processes with locally finite variations, while M} and are con- 
tinuous F(°)-local martingales. Next, let iV" = X]fc°=i ^{R''<t} for each t G and we introduce the following 
regularity conditions: 

[CI] bnNl" = Op{l) as n ^ oo for every t. 

[C2] A\ A^, A^, A^, and [V,W] for V,W = X'^ , X"^ , X} , are absolutely continuous with locally bounded 
derivatives. 

Furthermore, for every r S [2, oo) wc introduce the following regularity condition for noise: 
[N^] (/ \z\^Qt{dz))teR+ is a locally bounded process. 

A sequence (X") of stochastic processes is said to converge to a process X uniformly on compacts in probability 
(abbreviated ucp) if, for each t > 0, supQ<^<j |X" — Xs \ — >^ as ri — )• oo. We then write X" ^^^^ X. Wc have the 
following result about the consistency of the pre-averaged Hayashi-Yoshida estimator: 

Theorem 2.1 ([31], Theorem 5.1). Suppose (2.3), [C1]-[C2] and [N^ are satisfied. Then 
as n —> oo, provided that ^' > 1/2. 

The consistency of the pre-averaged HY estimator was first shown in Christensen et al. [13] in a simpler situation. 

Next we review the results related to the asymptotic mixed normality of the pre-averaged HY estimator, which 
was first proven in [15] when the sampling times are deterministic transformation of equidistant ones. In this paper 
we treat general sampling times, so that we review the result given in [31]. 

Let 7V"'^ = J2T=i l{s'=<t} aiid Nl'^^ = J2T=i l{f'=<t} foi' each t e 



and 



for each fc € N. Here, for each t G IR+ wc write S = sup^i^t. S*' and T = sup^^j T^. Note that and 
may not be stopping times. 

Let ^ be a positive constant satisfying i < ^ < 1. Furthermore, let H" = (H")tgR_^ be a sequence of filtrations 
of J" to which iV", A^"'^ and 7V"'^ are adapted, and for each n and each p > we define the processes x", G'(p)", 



J^(p)"'i, i^(p)"'2 and F(l)^ 



Xs 

Hp):-' 



by 



E 



{b-'\T^\Y\ni.-, 



F(i)';-^*^ = b~'E [\r n r 



F{p)-^^ = E[{b-^\P\Y\ni,_, 



when s € r'^. 

The following condition is necessary to compute the asymptotic variance of the estimation error of our estimator 
explicitly. For a sequence (X") of cadlag processes and a cadlag process X, we write A" X if (A") converges 

to X in probability for the Skorokhod topology. 
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[Al'] (i) For each n, we have a cadlag H"-adapted process G" and a random subset A/"" of N such that (#A/'")„gN 
is tight, G(l)^fc_i = for any fc S N — A/"", and there exists a cadlag F'°)-adapted process G satisfying 

Sk p 

that G and G_ do not vanish and that G" — ^ G as 71 — > oo. 

(ii) There exists a constant p > 1/^' such that (supq<^<j G(p)")^gj^ is tight for all t > 0. 

(iii) For each n, we have a cadlag H"-adapted process and a random subset N!^ of N such that {i^M^)n£N 
is tight, X^k-i = x'l^k-i for any fc e N — A/"^, and there exists a cadlag F^'^^-adapted process x such that 

, Sk.p. 

X > X as n — > cx). 

(iv) For each n and Z = 1, 2, 1 * 2, we have a cadlag H"-adapted process i^"'' and a random subset A/"^ of N 
such that (#A/'^)nGN is tight, F{l)^l_i = F'^k-i for any fc G N — A/"^, and there exists a cadlag F^^^-adapted 
processes satisfying i^"'' ^'^'^'> i^' as n 00. 

(v) There exists a constant p' > 1/^' such that (supo<s<t ^(p')"'')rieN tight for alH > and Z = 1, 2. 
The following condition is a sufficient one for the condition [Al']: 

[Al'l*] (i) For every p e [0,1/^'] there exists a cadlag F^^^-adapted process G{p) such that G{py^ Sk.p.^ G{p) as 
ri — >■ cxj. Furthermore, G and G_ do not vanish, where G = G(l). 

(ii) There exists a cadlag F'"'-adapted process x such that x" ^'^^ > x as n — >■ 00. 

(iii) For every Z = 1,2 and every p' € [0,1/^'], there exists a cadlag F^^^-adapted process -F(p)' such that 
F(p)"'' F{pY as n 00. 

(iv) There exists a cadlag F(°)-adapted process F(l)i*2 such that i^(l)".i*2 -^i£:> F(l)i*2 as n ^ 00. 

Remark 2.1. An [Al'l*] type condition appears in [5] and [21], for example. The reason why we introduce a kind 
of exceptional sets A/"^ {I = 0, 1, 2, 1 * 2,' ) is that the condition [Al'] without them is too local. To explain this, we 
focus on the univariate case. Note that in this case we have i?'^ ^ (fc = 0, 1, 2, . . . ). Let r be a positive number 
and suppose that (S**) be a sequence of Poisson arrival times whose intensity is A before the time t and A after r. 
Then the structure of the process G(l)" becomes very complex around the time r (of course if A ^ A), so that it 

will be difficult to verify the convergence G(l)" G because it requires a kind of uniformity. See Section 5.2 of 

[31] for more precise discussion. 

Next, we introduce the following strong predictability condition for the sampling designs, which is an analog to 
the condition [A2] in [23]. 

[A2] For every n,i G N, and T* are G("^-stopping times, where G^"^ = (5t"'')teM+ is the filtration given by 

The following conditions are analogs to the conditions [A3] and [A4] in [23]: 

[A3] For each V,W = , , X} , Ji^ , [V, W] is absolutely continuous with a cadlag derivative, and for the density 
process / = [V, W]' there is a sequence (ak) of F'^°) -stopping times such that for every k and any A > 0, we 
have a positive constant Ck,\ satisfying 

E [I/.? - /.?|'|-^r,Ar.] < Ck,xE [In - r2|i-^|j-,,^.,] (2.8) 

for any bounded F'^'^^-stopping times ri and T2, and / is adapted to H". 
[A4] C V ^ < ^' and (2.3) holds for every < e M+. 

The following conditions, which are analogs to the conditions [A5] and [A6] in [23], are necessary to deal with 
the drift parts: 

[A5] , A^, a} and are absolutely continuous with cadlag derivatives, and there is a sequence (cr^.) of F^*''- 
stopping times such that for every k we have a positive constant Gk and Xk S (0,3/4) satisfying 

E [|/f'= - /^"l^l^rM] < CkE [\t - t|1-^'= (2.9) 

for every t > Q and any bounded F'^'-stopping time r, for the density processes / = {A^)' , {A^)' , {A^)' and 

[A6] For each t e K+, b-^Hn{t) = Op(l) as n c», where i?„(t) = J^kLi |r''(i)P- 

Let r e [2, oo). The following condition is a regularity condition for the noise process: 
[Nr] (/ |2:|''(3t(dz))tgK+ is a locally bounded process, and the covariance matrix process 

^t(c^(o)) = / z2*Qt(w(°\dz). (2.10) 
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is cadlag and quasi-left continuous. Furthermore, there is a sequence (cr'^) of F'^^'-stopping times such that 
for every k and any A > 0, we have a positive constant Ck.x satisfying 



E 



< Ck.xh'-^ (2.11) 



for every i,j € {1, 2} and every t,h > 0. 

Remark 2.2. The inequahties (2.8), (2.9) and (2.11) are satisfied when w{f;h,t) = Op{h^~'^) as ft. — oo for 
every t,X £ (0, oo), for example. Here, for a real- valued function a; on R+, the modulus of continuity on [0,T] is 
denoted by w{x] (5, T) = sup{|a;(i) — x{s)\] s,t e [0, T], \s — t\ < S} for T,S > 0. This is the original condition in [23]. 
Another such example is the case that there exist an F'^^^-adaptcd process B with a locally integrable variation 
and a locally square-integrable martingale L such that f = B + L and both of the predictable compensator of the 
variation process of B and the predictable quadratic variation of L are absolutely continuous with locally bounded 
derivatives. This type of condition is familiar in the context of the estimation of volatility-type quantities; see [21] 
and [26] for example. Furthermore, in both of the cases / is cadlag and quasi-left continuous. 

We extend the functions g and g' to the whole real line by setting g{x) = g'{x) = for x ^ [0, 1]. Then we put 

2 j.2 j-2 



i^ -= I i'gA^) dx, K = I il}g>^g,{x) dx, K := / i}}g,g'{x) dx. 

J-2 J ~2 J -2 

We denote by D(E+) the space of cadlag functions on R+ equipped with the Skorokhod topology. A sequence of 
random elements defined on a probability space (fi, J-', P) is said to converge stably in law to a random element 
X defined on an appropriate extension (fi, J^, P) of (fi, J^, P) if E\Y g{X"')] — > E\Vg{X)\ for any J^-measurable and 
bounded random variable Y and any bounded and continuous function g. We then write X" — ^''^ X . 

Now we are ready to state the result related to the asymptotic mixed normality of the pre-averaged HY estimator. 

Theorem 2.2 ([31], Theorem 3.1). (a) Suppose [Al'](i)-(iii), [A2]-[A6] and [Ng] are satisfied. Suppose also X} = 
X^ = Q. Then 

b:;,^'^{PHY{y},x^Y -[x\x'^W [ wsdw^ in d(m+) 

Jo 

as n ^ oo, where W is a one- dimensional standard Wiener process (defined on an extension ofB) independent of 
T and w is given by 

+ e-'K{[xX^f + [x%^l' + 2[x\x%^l\S- (2.12) 

(b) Suppose [Al'], [A2]-[A6] and [Ng] are satisfied. Then 

b-^/*{PHY{X\X^)" -[X\X^]} [ WsdWs inD(M+) 

Jo 

as n ^ oo, where W is as in the above and w is given by 

+ e-'T^{[X'l-^f + [XX'^1' + 2[X\XX'^1' - {[X\X%Fl [X\X'IF!)'} , (2.13) 
where = ¥j + [X'j'.Fj {I = 1, 2) and *f = ^i^^, + [X\xXf}*\ 



3 Main results 



In this section we investigate the case that the latent processes possibly have jumps. Let ~ {Zl)t£s,+ and 
= {Zf)teR_^_ be two stochastic processes on 

(fj(0) jr(o)^p(o))_ Yiave the observation data = (Zi.)iez+ and 
Z^ = {Z'^j)j^z^ of Z^ and Z^ contaminated by noise: 



Z_5s — Zgi +Ucji, Zrpj — Zrpj + Uj'j . 
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Here, the observation noise {Ugi)i^z+ and (C/^j)jgz+ arc given by (2.4). 

The idea for the construction of our estimator as follows. The pre-averaging procedure smooths the noise and 
thus we can expect the prc-averaged data Z^(I)' and L^^jy are small enough if they contain no jumps. This idea 
has already appeared in Ai't-Sahalia et al. [1] and Podolskij and Ziggcl [39] in the univariate case and Jing et al. 
[28] in the synchronous case. Following this idea, we introduce the following quantity: 

Definition 3.1 (Pre-avcraged truncated Hayashi-Yoshida estimator). The pre-averaged truncated Hayashi-Yoshida 
estimator, or PTHY estimator of two observation data and is the process 

PTW(Z\Z2)^ 

where 

^'(J)' = %o (t) (n-- - ) - ^^<-^'' = g » (t) ('^- - '^"-) • 

= 0, 1,..., 

K"^^ = l^jgr; gi+fc^-jpjj^j fi+''n)=jt$} (Pri(''))nGN, ^ = 1,2, are two sequences of positive-valued stochastic processes. 

We will write gWi] := 9n{S'') and g^[j] := oiiiT-') for short. The above statistic was originally considered in 
[30] without the refresh sampling modification. Recently Wang et al. [45] also introduced such a statistic. In the 
present article we include this modification in order to obtain the central limit theorem for the estimator with a 
broad class of sampling schemes by using Theorem 2.2. 



3. 1 Finite activity jump case 

First we consider the case that the observed processes have at most finite jumps. We assume the following 
structural assumption: 

[F] For each Z = 1, 2 we have = + where is a continuous semimartingale on B^'^'^ given by (2.1), 

is a (simple) point process adopted to F'"-*, and (7[)fcgN is a sequence of non-zero random variables. 

Moreover, we impose the following condition on the threshold processes: 

[T] ^' > 1/2, and for each Z = 1, 2 we have g^it) = ctn{t)Pn, where 

(i) (Pn)nGN IS a sequcncc of (deterministic) positive numbers satisfying p„ — > and 

^t^^^lM^O (3.1) 

Pn 

as n — > oo. 

(ii) (aJj(t))„gN is a sequence of (not necessarily adapted) positive-valued stochastic processes. Moreover, 
there exists a sequence {R\.) of stopping times (with respect to F) such that i?^ f oo and both of the 

sequences (supq<(<^i al^{t)] and (supQ<j^jj.i [l/Q;Jj(t)] ) are tight for all k. 
Then we obtain the following theorem. 

Theorem 3.1. Suppose [F], [T], [C1]-[C2] and [N^] hold for some r e {2,oo). Then we have 
sup |PTW(Z\z2)^-iW(X\x2)^|=Op(6y4) + Opf(6y2p-i 

0<s<t \^ 

as n oo for any t > 0. 

Proof of this theorem is given in Appendix A. Combining this result with Theorem 2.1 or Theorem 2.2, we 
obtain the following results: 

Theorem 3.2 (Consistency of the PTHY estimator in finite activity case). Suppose [F], [T], [C1]-[C2] and [NJ,] 
hold for some r G (2, oo). Then we have 

PTW(Z\ Z^)" ^ [X\X'^] (3.2) 

as n ^ oo. 
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Theorem 3.3 (Asymptotic mixed normality of the PTHY estimator in finite activity case). 

(a) Suppose [Af'](i)-(iii), [A2]-[A6] and [F] are satisfied. Suppose also 2[} = ~ 0, [N^] holds for some r e [8, c») 
and [T] holds with hn^"^ ^'p^ — > oo as n ^ oo. Then 

b-^^^pfHY{Z\Z^)''-[X\X^]}^''- [ WsdWs inD(K+) (3.3) 

as n oo, where W is the same one in Theorem 2.2 and w is given by (2.f2). 

(b) Suppose [Af'], [A2]-[A6] and [F] are satisfied. Suppose also [N^] holds for some r £ [8, oo) and [T] holds with 
^^{r 3)/2(r 2)^^ — >• OO as n OO . Then (3.3) holds with that W is as in the above and w is given by (2.13). 

3.2 Infinite activity jump case 

Next we consider the case that the observed processes arc two general scmimartingales contaminated by noise. 
We need the following structural assumption. Let /3 e [0, 2]. 

[K^] For each I ~ 1,2, we have 

=X^+ k{S^) * {p^ - v^) + k'{S^) ★ fi\ 

where 

(i) is a continuous semimartingale given by (2.1). 

(ii) fi^ is a Poisson random measure on R+ x with intensity measure i^'(dt, dx) = dtF'-{dx), where {E\£'') 
is a Polish space and F' is a cr-finite measure on {E\£^). 

(iii) k{x) = a;l{|a,|<ii. and k' {x) = x ~ k(x) for each x € M. 

(iv) is a predictable map from 17^°' x x E^ into R. Moreover, there are a sequence {Rl.) of stopping 
times increasing to oo and a sequence (■0^) of non- negative measurable functions on i?' such that 

sup \S'-{u},t,x)\ <ipi{x) and / 1 A iji{xf F^dx) < oo . 

w(0)6f2(0),t<_R^w(0)) Je' 

(v) If /3 < 1, for the process ft = J^i K{S''{t,x))F''{dx), there is a sequence (crfc) of F^^^-stopping times such 
that for every k we have a positive constant Ck and G (0,3/4) satisfying (2.9) for every t > and 
any bounded F^^^-stopping time r. 

Here and below ★ denotes the integral (cither stochastic or ordinary) with respect to a some (integer-valued) 
random measure; see Chapter II of [27] for details. The above type of assumption appears in a lot of literature, for 
example [24]. [K^] implies that for each / = 1, 2 the generalized Blumcnthal-Getoor index of Z'' is less than /3. 

Theorem 3.4. Suppose [Kp] and [N,^,] hold for some /3 e [0,2] and r e (2,oo). Suppose also [C1]-[C2], [A4], [A6] 
and [T] are satisfied. Then we have 

sup \PTHY{Z\zy: PHYiWx'):] = o,{b]!') + O, (ib]!^p;A'^\ + o,{p]-P/') 

0<s<t \^ 'J 

as n OO for any t > 0. 

Proof of this theorem is given in Appendix B. Combining this result with Theorem 2.1 or Theorem 2.2, we 
obtain the following results: 

Theorem 3.5 (Consistency of the PTHY estimator in infinite activity case). Suppose [K2] and [N^] hold for some 
r e (2,00). Suppose also [C1]-[C2], [A4], [A6] and [T] are satisfied. Then we have (3.2) as n ^ 00. 

Theorem 3.6 (Asymptotic mixed normality of the PTHY estimator in infinite activity case). 

(a) Suppose [Al'](i)-(iii) and [A2]-[A6] are satisfied. Suppose also [N^] holds for some r e [8,00), [K^] holds for 

some P E [0, 2 — 2£/-i ) '^'^'^ M holds with bn ^■'p„ — > 00 and p„ ~ 0{bl/^^^ as n —>■ 00. Moreover, 

suppose X} = X^ = 0. Then (3.3) holds true as n 00 with that W is the same one in Theorem 2.2 and w is 
given by (2.12). 

(b) Suppose [Al'] and [A2]-[A6] are satisfied. Suppose also [N^] holds for some r € [8,00), [K^] holds for some 
/3 G [0,2— 2^-1 ) '^'^'^ \^~\ holds with bn'"^ ■^)/^('' _s. qq and p„ ~ 0{bl/^^'^ as n ^ 00. Then (3.3) holds 
true as n —> 00 with that W is as in the above and w is given by (2.13). 

Note that the assumptions of Theorem 3.6 require at least (3 < 1. 
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4 Some related topics for statistical application to finance 



4-1 Estimation of the quadratic covariation of jump parts 

As stated in the introduction, we are interested in the estimation of the quadratic covariation of jump parts of 
two semimartingales and . This is achieved by estimating the quadratic variation [Z^, Z'^] due to the formula 
(1.2) because we can estimate the integrated covariance {Z^^'^, Z'^''')t by the PTHY estimator as investigated in 
the previous section. In the literature such estimators are usually given by consistent estimators for the integrate 
covariance in the absence of jumps. See [33] and [37] for example. Following this approach, we consider the 
pre- averaged HY estimator and we obtain the following result. 

Proposition 4.1. Suppose [C1]-[C2], [A2], [A4], [A6], [K2] and [N^] are satisfied. Then 

PHY{Z\Z^)-^[Z\Z\ + 0,{b]/^) 

as n —> 00 for any t G K-|_ . 

See Appendix C for a proof. Consequently, we obtain the following result on the issue of the estimation of the 
quadratic covariation of jump parts: 

Corollary 4.1. Suppose [C1]-[C2], [A2], [A4], [A6], [K2] and [NJ,] for some r e (2, 00) are satisfied. Suppose also 
that [T] holds. Then 

PHY {Z^, Z^)"} ~ PTHY [Z^.Z^)"} ^ AZ^AZ^ 

0<s<t 

as n —> 00 for any t £ M+. Furthermore, if bn^^ 3)/2('' 2)^^ _^ ^ _ ^^^1/2(2 /3)-j ^ _^ then 

PHY {Z\Z^)^ - PTHY {Z\Z^yi ^ ^ AZ] AZf + 0,(&y4) 

0<,'5<t 

as n 00 for any t £ M_|_ . 



4.2 Autocorrelated noise 

We have so far assumed that the observation noise is not autocorrelated, conditionally on J^'-^K In empirical 
studies of financial high-frequency data, however, there is a lot of evidence that microstructure noise is autocor- 
related (see [20] and [43] for instance). In this subsection we briefly discuss the case that the observation noise is 
autocorrelated conditionally on 7^*^°^ as [15] did in the continuous case. 

We focus on the synchronous case. That is, we assume that 5* = T* for all i. Note that in this case it holds that 
gk _ rpk _ j^k _ gk g^jj ]^^|- (A^)„gz+ and {^\j)uez+ {I = 1, 2) be four sequences of real numbers such that 



^wlA'j<c3o and J2u\fil\<oo. (4.1) 

U—1 U—1 

We assume that the observation data (Z^^ ) and (Z^^ ) are of the form 



■11=0 u=0 



u=0 u=0 



(4.2) 



In other words, the observation noise follows a kind of linear processes. Under such a situation the consistency of 
our estimators is still valid: 

Proposition 4.2. Suppose (4.1) and (4.2) are satisfied. Suppose also [C1]-[C2], [A2], [A4], [A6], [K2] and [N^j 
are satisfied. Then PHY{Z^, Z^)" -^p [Z^,Z'^]t as n ^ 00 for any t e IR+. Furthermore, if [T] and [N^] for some 
T G (2,00) hold, then PTHY{Z^,Z^yi {X^ as n ^ 00 for any t G IR+. 

We give a proof of Proposition 4.2 in Appendix D. The proof is based on a Bevcridge-Nclson type decomposition 
for the noise. It might be possible to prove the asymptotic mixed normality of the PTHY estimator under the 
above model by refining on the proof of the above proposition given in the present article. On the other hand, in 
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the nonsynchronous case we will need to model the autocorrelation structure of the noise on the time dependence 
in calender time (as [43] did) rather than tick time (as in the above). This is because we have two axes of tick 
time, (S^) and (T-'), in the nonsynchronous case and this fact complicates the analysis of our estimator. With an 
appropriate modeling of the autocorrelation structure of the noise, we can probably obtain a Beveridge- Nelson type 
decomposition for the noise even in the nonsynchronous case. As a result, we might be able to prove the consistency 
and the asymptotic normality of our estimator. However, these topics are beyond the scope of this paper, so that 
we postpone them to further research. 



4-. 3 Estimation of asymptotic variance 

In this subsection we shall briefly discuss the estimation of the asymptotic variance of the PTHY estimator. 
This is necessary to construct feasible confidence intervals of this estimator, for example. We focus on the simple 
case that the endogenous terms of the microstructure noise are absent, i.e. X} = = 0. In this case our aim can 
be achieved by a kernel-based approach as in [23] and [30]. 

More precisely, let (ft.„) be a sequence of positive numbers tending to as ti — > oo. For any s G M-|_, put 



i^^^'Ys = (PTHY{Z\ Z'')^ - PTHY{Z', Z'')'A_ 



[X']'^ = [X\X^]'^ 



1,1' = 1,2 



and 



*^s = -^ E (z^.-z^._0(zU-40, ^'^^-/T S (z|, -z|,_0(z?^..i-z|,), 

" i:,5-h<S*+l<s " j:s-/l<TJ + l<,s 

"^^^Xs " ^7rr~ l(z§fc ~ Zgj._j)(Z-j._^j — z-^) + (Z-^^^ — 7.^^,){z~^ — z-^_^)| ^s^gh^fk^- 



fc:s-/l<i?,'" + l<s 



Then we set 



enb-'l[X^\^k[X^\^k + [[X\X^]'^k 



Ipfei Ipfc+i I 



+6-'nb-' + m'j,,^^,,. + 2[JM2]'^,^T5^^.} |r^ 



for every fc e N and wjAs = bn J2k R''<t '^'^ R'' fo'^ every t G 
Proposition 4.3. Suppose [Al'](i)-(iii) and [A2]-[A6] are satisfied. Suppose also [N^] holds for some r S [8,oo), 
[K/j] holds for some /? £ [0,2— 2S/^-i ^ '^""'^ I'^l ^"Zds with b, 
Moreover, suppose X} — X^ — 0. Then 



-(r-3)/2(r-2) 



p„ — >■ oo and pn = 0{b 



1/2(2-/3) 



) as n —>■ oo. 



w^ds 



w^ds 



as n oo, provided that h^^^bl/^ 



0. 



Proof. Fix a T > 0. First, by Lemma 10.2 of [31] we obtain ds = 6„ J2k R''<t ^ 
in t e [0,T]. Next, an argument similar to the proof of Proposition 4.2 of [30] yields 



-Op(l) uniformly 



bn J2 E[^\k\nik 



k:R>'<t 



--rny E \Q>^{{X^\rAX^\r^ + ([X\XY^.)'} G(1)S.| +rt6„ + (^^'.Xi^O'} 

-B-^li{{X^X^k^% + [x2]^.vl/^\ +2[X\A%.*^2.Xfi^-} |r1] +0,(1) 



k:R^<t 



uniformly in t G [0,T]. Then, by an argument similar to the proof of Lemma 10.3-10.4 of [31] we conclude that 



k:R^<t 



E 



/g wids + Op(l) uniformly in t e [0,T]. These results complete the proof of the 



proposition. 



□ 



The above approach has the disadvantage that it depends strongly on the particular form of the asymptotic vari- 
ance due to the noise. In fact, it is not adapted to the case that the endogenous terms of the noise is present because 
we have so far known no estimator for the statistic {\X} ,X'^]'gEl — [X^ , X^\'^F'^) ds which is the asymptotic 
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variance due to the presence of the endogenous noise. We wih also need to modify it if the noise is autocorrelated 
since we wih need to replace the covariance matrix ^ of the noise in the asymptotic variance with the long-run 
covariance matrix of the noise (see the proof of Proposition 4.2 in Appendix D). To avoid this problem, we might 
rely on the approach used in Section 4 of Christensen et al. [15] or the subsampling approach developed by Kalnina 
[29] recently though it remains for further research to verify the theoretical validity of them. 



5 Simulation study 



In this section, we examine the finite sample performance of our estimators by using Monte Carlo experiments. 



5. 1 Choice of the threshold processes 

As is well known, the thresholding method is often sensitive to the selection of thresholds in finite samples; see 
[41] or the Web Appendix of [33] for instance. Therefore, it is important to determine a reasonable rule of selecting 
thresholds. Here we present an easy but effective way to determine thresholds. Formal study of methods for optimal 
threshold selection in a given model is an important issue for the future. 

We will determine the thresholds for individual processes so that we focus on the univariate case. First we 
compute an auxiliary estimator S" for the spot variance process St = 6ip2[X^]i + ^ipi'^V fo'" each sampling time t, 
where "01 = g'{s)'^ds and tp2 — g{s)'^ds. In this paper we will use a numerical derivative of the pre-averaged 
bipower variation, i.e. 

_2 i-2k„ 

^g'^ g-?fc„ + l ^ |z\in|z'(ir'-"|, ^ = K,K + 1,...,N 

p—i — K 

and = if z < K . Here, fii is the absolute moment of the standard normal distribution, N is the number 

of the available pre-averaging data (Z^(X)') and iC' is a bandwidth parameter such that K = 0{b~") as ?? oo for 
some a G (0.5, 1). We will set K = [iV^/^] below. Such a kind of spot variance estimator was studied in Bos et al. 
[11]. Then we choose 

Ql{S') = 2log{Ny+%,, z = 0,l,...,7V (5.1) 

for some e > 0. We will set e = 0.2 below. 

The heuristic idea behind the above choice of thresholds is as follows. First we recall the following classic result: 

Theorem 5.1 (Pickands [36], Theorem 3.4). Let (Xi)igN be a stationary Gaussian process such that E[Xi] ~ 0, 
E[Xf] = 1 and E[XiXi^k] = l(k). //limfe_>.oo lik) = 0, then maxi<,;<„ Xij \J2 log n — > 1, almost surely, as n — > oo. 

The most important point of the above theorem is that the random variables X^, i = 1, 2, . . . , in the theorem can 
have a kind of dependence structure. This fact is crucial for the present situation because the pre-averaging data 
(Z ixy) is A:„-dcpendcnt. As a result, Theorem 5.1 has the following implication: Suppose that the observation 
data is given by a scaled Brownian motion with i.i.d. Gaussian noise. That is, suppose that 7.gi = aWgi + Ui, 
i ~ 0, 1, ... , where ct > 0, Wt is a standard Wiener process and (ui) is an i.i.d. random variables independent of 
W with Ui 7V(0,w^). Suppose also that {S') is an equidistant sampling scheme. Then the pre-averaging data 

(Z (I)*) is a centered stationary Gaussian process with the autocovariance function vanishing at infinity, so that 
Theorem 5.1 yields 

max Z^ (ly I a/S •21og7V 1 a.s. 

0<i<7V-l ^ ' / 

as n — > oo, where S = Oip20''^ + is the variance of Z^(I)*. This result suggests that we may use S • 21ogiV 
as thresholds. This idea has already been introduced as the universal threshold by [18] in the context of wavelet 
shrinkage. [18] estimated the unknown parameter S by the square of the median absolute deviation (MAD) of 

(Z (I)*) divided by 0.6745, the 0.75-quantile of the standard normal distribution. In the present situation (Z (X)') 
is heteroscedastic in general, hence we need to replace E with the spot variance process St and estimate St by 
S". Consequently, we have arrived at the threshold process given by (5.1), where we multiply the usual universal 
threshold by (logA^)*^ to ensure the condition (3.1). 

Since the threshold process (5.1) can be regarded as a pre-averaging version of the local universal threshold 
proposed in [30], so we call (5.1) the pre-averaged local universal threshold (abbreviated PLUT). The author conjec- 
tures that the PLUT satisfies the condition [T] under some mild regularity conditions though we do not investigate 
this topic in this paper and postpone it in the future. Instead, we will numerically show that the PLUT gives a 
reasonable choice of the threshold processes below. 
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5.2 Simulation design 

We simulate over the interval t e [0,1]. We normalize one second to be 1/23400, so that the interval [0,1] 
contains 6.5 hours. In generating the observation data, we discretize [0, 1] into a number n ~ 23400 of intervals. 

In order to extract irregular, nonsynchronous observation times from n equi-spaced division points, we generate 
random observation times (S**) and (T^) using two independent Poisson processes with intensity n/Xi and njXi. 
Here A; denotes the average waiting time for new data from process Z', so that a typical simulation will have n/A; 
observations of Z', ^ = 1,2. Following [5], we vary A := (Ai, A2) through the following configurations (3,6), (10,20) 
and (30,60). Note that because we are simulating in discrete time, it is possible to sec common points to the 
observation times (S**) and (T^ ). 

We consider three types of bivariate Levy processes Jt with no Brownian components to introduce jumps to 
models. The specifics of the jump processes are as follows: 

NO J = 0, i.e. there are no jumps. 

SCPl Let L be a stratified normal inverse Gaussian compound Poisson process with a single jump per unit time 
(i.e., the jump time is uniformly distributed over [0, 1] and the jump size follows a normal inverse Gaussian 
distribution). The jump size is drawn from e\fS^ where eJlS', e '-^ A^(0, 1) and S ^ IG{c,c^ /^)., so that 
Var[£^/5] = E[S] = c and Var[S'] = c^/ic^/j) = cj. Then, we set = .P ^ L. 

VG Let and be mutually independent variance Gamma processes such that L[ ^ e\/S, where e _LL S*, e ~ 
7V(0, 1), and S - r(c/7, 1/7), so that Var je'V^] = E[S] = c for each Z = 1, 2 and Var[5'] = {ch)/{lhf = C7. 
Then, we set = said ,P = RL^ + \/l - R^L'^. 

In the simulation we set c — 0.1 and 7 = 0.25. The value of R is given for each model below. Note that each 
component of the above models coincides with the model simulated in Veraart [44]. 
The observation data (Z^^) and (Z^j) are generated from the models below. 

Model 1 (Barndorff-Nielsen et al. [5]) — the case of stochastic volatility & additive noise. The following 
bivariate factor stochastic volatility model is used to generate the continuous scmimartingales and X^: 



dX[^li'dt + p'a[B^ + ^l-{piYcj[Wt, crj =exp(/3^ + /3i^?^)> Aq[ ^ a' Q\At + dB[, Z = l,2, 

where (5^,5^, W^) is a 3-dimensional standard Wiener processes. The initial values for the q\ processes at each 
simulation run are drawn randomly from their stationary distribution, which is q\ ~ N{Q, {—2a^)~^) . We carry out 
our numerical experiments by using the following parametrization, assumed to be identical across the two volatility 
factors: {tJ-, fi^, I3{,a\ p^) = (0.03,-5/16,1/8,-1/40,-0.3), so that I3[ = {(i{f /2aK This choice of parameters 
implies that integrated volatility has been normalized, in the sense that £'[/q {<j\)'^As\ = 1. At each simulation run 
we add noise {U\,,)^^q simulated as 



'^'^-El^vJ ' ^^^d Corr(C/i,C/,2) = | 



R if t = s, 
if<7^s, 



where R = y/l — {p^)^^l — {p'^Y the noise-to-signal ratio, rj^ takes the value 0.001. Finally, Z^^ and Z^^ are 
given by Z^. = Z^, + C/i, and Z|,, = Z^^ + U^, , where = + j', ^ = 1, 2. 

Model 2 (Jacod et al. [25]) — the case of constant volatility & rounding plus error. 



Z\=X\+a'jl X\=Xl + a'W„ 



A/n - log 



cxp(^i./„ 



k/nl 



'/i/n loj 



exp(4/J ' 



where and W'^ are correlated standard Winer processes independent of J with d[VK^,VK^]t = Rdt and (?][./„) 
is a sequence of independent Bernoulli variables (probabilities and 1 — pj..^^ of taking values 1 and 0), with 

P\/n log ( °l!lfz/"\ ) / log I " Lxp(z'. ^ I • Wc assume that the sequences (?7L„) and (77^ , ) arc mutually 



independent as well as independent of W and J. Parameters used: a = 0.2/V252, Xq = log(8), a — 0.01 and 
R = 0.5. 

Model 3 — the case of stochastic volatility & endogenous noise. The model of the continuous scmimartingales 
X^ and X^ is the same one as in Model 1, but the noise processes Ugi and U"^, are given by 
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Here we set 5^ = 5^ = —0.01, so that the microstructurc noise is negatively correlated with the returns ol the latent 
continuous semimartingale processes and . This choice reflects the empirical findings reported in Hansen 
and Lunde [20]. Note that the magnitude of the noise processes in this model is smaller than the one in Model 1. 
Finally, as in Model 1 we set Z\i = Z^, + U^i and Z'^j = Z^^ + U^^, where Z^ = X^ + j'' , I = 1,2. 

1000 iterations were run for each model. The simulation of the model paths of X^ and X^ has been made using 
the Euler-Maruyama scheme with n equi-spaced division points. 

The estimators are calculated componentwise. That is, we use all of the observations for the variance estimators 
and the refresh time sampling of the observations for the covariance estimator. The tuning parameters for pre- 
averaging are selected as follows. We use 9 = 0.15 and 17(2;) = x A {1 — x) following Christensen et al. [15], and 
set kn = \6^/rn \ . Here, m represents the number of the observed returns when calculating the variance estimators 
while the number of refresh times minus 1 when calculating the covariance estimators. 

5.3 Simulation results 

In Table 1-3, we present the bias and the root mean squared error of our PTHY estimator in Model 1-3 
respectively. As a comparison, we also computed the subsampled realized bipower (co)variations (BPVs) based on 
5-minutes returns. More precisely, we computed a total of 300 realized bipower (co)variations by shifting the time 
of the first observation in 1-second increments. Then, we took the average of these estimators. Such an operation is 
commonly used in empirical work in this area; see [4] and [34] for example. Note that we divide the reported numbers 
in Table 2 by (0.2/-\/252)^ for normalization. We can see that our estimator performs well even in finite samples 
especially when the frequency of the observations are relatively large. When the frequency of the observations is 
small, it is downward biased due to the loss of summands induced by pre-averaging and the nonsynchronisty of 
the observation times. It is not surprising, because reducing the effects induced by microstructurc noise forces the 
estimation to be less efficient. In contrast, the variance estimation by the BPV statistic is significantly upward 
biased across all scenarios for Model 1-2, while downward biased across almost all scenarios for Model 3. Although 
for Model 1-2 the covariance estimator of the BPV is surprisingly more precise than the variance estimator, our 
PTHY estimator is superior to it in most of the scenarios. 

In Table 4-6, we present the bias and the root mean squared error of estimators for the quadratic (co)variations 
jyk.i [Z'',Z'-]i - [X'',X^]i {k,l = 1,2) of the jump processes (JV) in Model 1-3 respectively. We also report 
the results for the estimators based on the differences between the subsampled realized (co)variances (RV) based 
on 5-minutes returns and the BPVs for a comparison. The reported numbers in Table 2 are divided by (0.2/-\/252)^ 
for normalization as above. As the tables reveal, our estimator is downward biased in the presence of jumps when 
the frequency of the observations is large and such a downward bias tends to be large in the VG case. It is not 
surprising because the thresholding technique cannot detect too small jumps whose sizes have the same magnitude 
as those of Brownian increments (note that in finite samples we cannot identify very small jumps in principle; 
see Zhang [46] for details). Although the bias is modest compared with that of the BPV based statistic (when 
the sampling frequency is sufficiently large) , we might need to investigate the possibilities of making finite sample 
adjustments. We also find that both of the estimators are upward biased in the absence of jumps. It is theoretically 
natural because these estimators should be non-negative asymptotically (note that in the simulation we always have 
JV^'^ > 0). In particular, our estimator is far more precise than the BPV based statistic if the sampling frequency 
is large. However, our estimator does not perform well when the frequency of the observations is very small. In 
this case our estimator is inferior to the BPV based statistic in some situations. It is also worth mentioning that in 
terms of the bias our estimator has slightly worth performance in Model 3 than Model 1, despite the fact that the 
magnitude of the noise in Model 3 is smaller than that in Model 1. 

Finally, we briefly compare our simulation results with some existing empirical studies in this area. Recently 
Christensen et al. [14] indicated low-frequency based measures of jump variations such as the above BPV based 
statistic tend to be upward biased. In our simulation, the BPV based estimators for the quadratic variations of 
the jump processes are upward biased in the absence of jumps. They are also upward biased across all scenarios in 
Model 2. These findings have the following implication. First, in real markets jumps might not often occur (or they 
might be too active to be disentangled from diffusive components as indicated in [1]) and perhaps their magnitude 
is not so large. Second, the round-off effects could be significantly important for measuring jump variations. This is 
of course quite natural intuitively. On the other hand, in Table 3 of [14] we can find that the reported values of their 
low-frequency based bipower variations are smaller than those of their tick frequency based estimators for almost all 
assets, so that we expect the low-frequency based bipower variations will be downward biased. We can observe such 
phcnomenons (only) across all of the scenarios in Model 3. This is not surprising because the downward bias of the 
BPV based estimators for integrated variances presumably appears only in the presence of the negative correlation 
between the noise and the efficient returns. 
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Table 1: Simulation results of the estimation of integrated (co) variances in Model 1 



PTHY BPV 

Target [X^i [X\X\ [X^i [X^^ [X\X\ [X^' 



NO 

A =(3, 6) -.003 (.136) -.002 (.104) -.004 (.241) .132 (.291) -.020 (.136) .131 (.379) 

A =(10, 20) -.005 (.196) -.011 (.135) -.022 (.300) .131 (.291) -.058 (.165) .122 (.399) 

A =(30, 60) -.028 (.256) -.037 (.200) -.054 (.425) .120 (.291) -.142 (.246) .092 (.412) 

SCPl 

A =(3, 6) .004 (.137) .003 (.104) .002 (.237) .164 (.318) .012 (.148) .164 (.399) 

A =(10, 20) .005 (.198) -.006 (.137) -.014 (.298) .163 (.315) -.028 (.166) .155 (.418) 

A =(30, 60) -.018 (.259) -.033 (.203) -.044 (.421) .151 (.315) -.116 (.236) .123 (.428) 

VG 

A =(3, 6) .005 (.136) .005 (.103) .011 (.239) .171 (.328) .017 (.150) .177 (.412) 

A =(10, 20) .006 (.200) -.003 (.137) -.002 (.303) .170 (.326) -.022 (.165) .168 (.430) 

A =(30, 60) -.014 (.256) -.030 (.200) -.033 (.427) .158 (.322) -.111 (.235) .136 (.438) 

Note. We report the bias and rmse of the estimators for the integrated (co)varianccs included in the simulation study. The number 
reported in parenthesis is rmse. 



Table 2: Simulation results of the estimation of integrated (co)variances in Model 2 
PTHY BPV 



Target [X'h [X\X\ [X^], [X^l [X\X\ [X^], 

NO 

A =(3, 6) -.000 (.090) .003 (.084) .002 (.010) .111 (.176) -.021 (.109) .110 (.177) 

A =(10, 20) .001 (.118) -.007 (.114) -.011 (.142) .109 (.115) -.043 (.119) .106 (.181) 

A =(30, 60) -.021 (.159) -.036 (.161) -.057 (.186) .101 (.180) -.105 (.156) .073 (.181) 

SCPl 

A =(3, 6) .008 (.089) .011 (.085) .010 (.103) .150 (.214) .013 (.122) .149 (.213) 

A =(10, 20) .014 (.118) .005 (.115) .001 (.144) .148 (.215) -.011 (.124) .145 (.218) 

A =(30,60) -.006 (.160) -.025 (.161) -.045 (.188) .140 (.218) -.076 (.145) .107 (.205) 

VG 

A =(3, 6) .010 (.090) .002 (.086) .016 (.109) .154 (.222) .000 (.118) .158 (.220) 

A =(10, 20) .004 (.118) -.003 (.118) .008 (.118) .152 (.224) -.019 (.121) .153 (.222) 

A =(30, 60) -.010 (.159) -.038 (.159) -.036 (.192) .143 (.226) -.086 (.152) .119 (.212) 

Note. We report the bias and rmse of the estimators for the integrated (co)variances included in the simulation study. The number 
reported in parenthesis is rmse. All of the reported numbers are divided by (0.2/\/252)^. 
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Tabic 3: Simulation results of the estimation of integrated (co)variances in Model 3 



Tar; 


2;et 




PTHY 






BPV 
















NO 
















A = 


(3,6) 




~ 009 { lOO^i 




— 097 ( 99Q\ 




— 04^ ( '^JiW 


A = 


(10,20) 


-.008 (.186) 


-.017 (.138) 


-.033 (.305) 


-.042 (.236) 


-.066 (.171) 


-.073 (.401) 


A = 


(30,60) 


-.039 (.248) 


-.055 (.198) 


-.079 (.433) 


-.073 (.254) 


-.155 (.259) 


-.144 (.460) 


SCPl 














A = 


(3,6) 


.-.000 (.127) 


.000 (.101) 


-.004 (.215) 


-.010 (.229) 


-.016 (.149) 


-.026 (.370) 


A = 


(10,20) 


-.003 (.186) 


-.014 (.138) 


-.028 (.300) 


-.012 (.237) 


-.037 (.170) 


-.042 (.399) 


A = 


(30,60) 


-.035 (.246) 


-.053 (.199) 


-.074 (.428) 


-.056 (.251) 


-.141 (.253) 


-.127 (.455) 


VG 

A = 


(3,6) 


.002 (.127) 


.002 (.100) 


.003 (.215) 


.010 (.243) 


.003 (.156) 


.000 (.375) 


A = 


(10,20) 


-.001 (.186) 


-.013 (.139) 


-.018 (.303) 


-.005 (.246) 


-.031 (.171) 


-.028 (.399) 


A = 


(30,60) 


-.028 (.246) 


-.051 (.196) 


-.062 (.435) 


-.037 (.255) 


-.013 (.247) 


-.102 (.452) 



Note. Wo report the bias and rmse of the estimators for the integrated (co)varianccs included in the simulation study. The number 
reported in parenthesis is rmse. 



Table 4: Simulation results of the estimation of jump (co) variations in Model 1 



PTHY 



BPV 



Target 



JV 



1,2 



JV 



2,2 



Jl/2,2 



NO 

A -(3,6) 
A (10,20) 
A = (30,60) 

SCPl 
A = (3,6) 
A = (10,20) 
A = (30,60) 

VG 

A = (3,6) 
A = (10,20) 
A = (30,60) 



.000 (.005) 
.003 (.018) 
.018 (.071) 

-.006 (.032) 
-.006 (.050) 
.008 (.097) 

-.008 (.039) 
-.007 (.053) 
.003 (.098) 



.001 (.003) 
.006 (.020) 
.025 (.066) 

-.003 (.037) 
.002 (.055) 
.021 (.089) 

-.007 (.041) 
-.003 (.053) 
.014 (.092) 



.001 (.005) 
.009 (.047) 
.030 (.113) 

-.004 (.041) 
.002 (.071) 
.021 (.125) 

-.017 (.046) 
-.013 (.070) 
.004 (.125) 



.011 (.075) 
.013 (.074) 
.020 (.097) 

-.021 (.098) 
-.018 (.098) 
-.012 (.113) 

-.029 (.116) 
-.026 (.112) 
-.020 (.124) 



.016 (.049) 
.014 (.055) 
.023 (.063) 

-.022 (.077) 
-.020 (.084) 
-.016 (.096) 

-.030 (.097) 
-.028 (.100) 
-.026 (.121) 



.011 (.093) 
.016 (.107) 
.044 (.132) 

-.021 (.111) 
-.016 (.122) 
.013 (.135) 

-.038 (.129) 
-.034 (.138) 
-.005 (.140) 



Note. We report the bias and rmse of the estimators for the jump (co)variations included in the simulation study. The number 
reported in parenthesis is rmse. 
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Table 5: Simulation results of the estimation of jump (co)variations in Model 2 



PTHY BPV 



Target jyl'l jy2,2 jyl,l jyia jy2a 

NO 



A = 


(3,6) 


.000 (.002) 


.001 (.003) 


.001 (.005) 


.134 


(.141) 


.010 (.038) 


.137 


(.145) 


A = 


(10,20) 


.003 (.011) 


.006 (.015) 


.009 (.023) 


.136 


(.144) 


.013 (.040) 


.141 


(.152) 


A = 


(30,60) 


.017 (.039) 


.035 (.059) 


.052 (.087) 


.143 


(.155) 


.018 (.049) 


.169 


(.186) 


SCPl 


















A = 


(3,6) 


-.007 (.032) 


-.009 (.037) 


-.008 (.040) 


.095 


(.127) 


-.025 (.082) 


.099 


(.136) 


A = 


(10,20) 


-.009 (.047) 


-.005 (.054) 


-.001 (.060) 


.098 


(.130) 


-.024 (.087) 


.103 


(.141) 


A = 


(30,60) 


.004 (.065) 


.023 (.086) 


.038 (.107) 


.104 


(.139) 


-.022 (.104) 


.133 


(.173) 


VG 
A = 


(3,6) 


-.012 (.036) 


-.009 (.034) 


-.019 (.045) 


.082 


(.135) 


-.021 (.071) 


.083 


(.122) 


A = 


(10,20) 


-.012 (.052) 


-.004 (.046) 


-.015 (.061) 


.084 


(.137) 


-.021 (.072) 


.085 


(.127) 


A = 


(30,60) 


-.002 (.078) 


.024 (.085) 


.025 (.108) 


.089 


(.146) 


-.013 (.084) 


.118 


(.163) 



Note. We report the bias and rmse of the estimators for the jump (co)variations included in the simulation study. The number 
reported in parenthesis is rmse. All of the reported numbers are divided by (0.2/\/252)^. 



Tabic 6: Simulation results of the estimation of jump (co) variations in Model 3 
PTHY BPV 



Target JV^-^ JV^'^ jyl,2 jy2,2 

NO 

A = (3,6) .001 (.007) .002 (.006) .003 (.016) .013 (.087) .009 (.051) .012 (.107) 

A ==(10,20) .007 (.032) .011 (.032) .017 (.060) .014 (.091) .009 (.052) .017 (.108) 

A = (30,60) .030 (.089) .039 (.081) .055 (.153) .022 (.094) .015 (.054) .053 (.138) 

SCPl 

A = (3,6) -.003 (.022) -.001 (.027) .000 (.035) -.004 (.093) -.010 (.065) -.006 (.112) 

A = (10, 20) .001 (.052) .009 (.059) .013 (.078) -.016 (.110) -.024 (.083) -.014 (.120) 

A = (30,60) .025 (.098) .038 (.093) .051 (.157) .006 (.099) -.005 (.069) .037 (.135) 

VG 

A = (3, 6) -.005 (.038) -.004 (.041) -.010 (.045) -.026 (.123) -.032 (.098) -.037 (.138) 

A = (10, 20) -.000 (.055) .005 (.058) .001 (.073) -.025 (.124) -.033 (.100) -.031 (.136) 

A = (30, 60) .018 (.109) .003 (.103) .034 (.151) -.015 (.122) -.033 (.118) .006 (.137) 

Note. We report the bias and rmse of the estimators for the jump (co)variations included in the simulation study. The number 
reported in parenthesis is rmse. 
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Appendix 



A Proof of Theorem 3.1 

First note that for the proof we can use a locahzation procedure, and which allows us to systematically replace 
the conditions [CI], [C2], [N^] and [F] by the following strengthened version: 

[SCI] There is a positive constant K such that hnN^^ < K for all n and t. 

[SC2] [C2] holds, and [A^)', (A^)', (A^)', (A^)' and [V,W]' for each V,W = X\X^,X\X^ are bounded. 
[SN^] (J \z\^Qt{dz))tsz]^^ is a bounded process. 
[SF] We have [F] and there is a positive constant B such that 

i3-i< inf |7l|<sup|7[|<B (A.l) 

fcGN fcgN 

for each Z = 1, 2. 

Next we introduce the following strengthened version of the condition [T] : 

[ST] For each / = 1, 2 we have gn{t) = al^{t)pn, where (pn)neN is the same one in [T] and {al^{t))neN is a sequence 
of (not necessarily adapted) positive- valued stochastic processes such that there exists a positive constant Kq 
satisfying 

< inf a(,«) (t) < sup al?^ (t) < Ko, 7i = 1, 2, . . . . 
Lemma A.l. Let (c„) be a sequence of positive numbers. If we have 

c-^{pfHY{Z\Z^y' ~PHY{X\X^y'} ^0 as n ^ oo (A.2) 
under the condition [ST], then we have also (A.2) under the condition [T]. 

Proof. Let t > and k E N. Suppose that [T] holds. Then, for an arbitrary e > 0, there exists a positive number 
K such that 

supP sup al^{s)>K\<e and supP sup [1 / al-^{s)] > K ] < e , 1 = 1,2. 

neN \0<s<R'^ J n6N \0<s<fl^ J 

Hence for any 77 > we have 

^ <k]. 



P (*"(0 > 7?) < PiRl ARl<t)+4:e + Pl *"(* ARIa RI) > t?, ^max^ 



I r ^ 1 

sup a„(s) V sup -J— - 

0<s<_R'. 0<s<Bi O^nl^y 



where 4'"(t) supo<,<t c"! |PTi7r(Z\ Z^)^ - PHY{X\X'^)'^\. Therefore, by the assumption we obtain 

limsupP(1'"(i) > 77) < P{RIaRI < + 4£. 

Since e is arbitrary, we can replace e in the above inequality with 0. Finally, with k tending to 0, we obtain the 
desired result. □ 

We need a modification of sampling times as follows. We write r„ = 6| . Next, let u„ = inf{t[r„(i) > f„}, and 
define a sequence {S^)i^z+ sequentially by = S" if 5* < u„, otherwise 5* = 5*^^ + r„. Then, (5*) is obviously a 
sequence of F'^^-stopping times satisfying (2.2) and supjgjs[(S'* — S*'"^) < f„. Furthermore, for any i > we have 
PiCliiS' At S' At}) < P{vn < <) ^ as n ^ 00 by (2.3). By replacing {S') with (T-?), we can construct a 
sequence (T^) in a similar manner. This argument implies that we may also assume that 

sup r„(i) < f„ (A.3) 

tGR+ 

by an appropriate localization procedure. 

Finally, we note that the inequality (3.2) of [31] holds true: 

00 00 

^i^'=^<2fc„ + l and ^/r'=<2fc„ + l, fc = 0, 1,.... (A.4) 

j=0 i=0 
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Now we introduce some notation and prove some lemmas which we wih also use later. Set ~ gip/kn) and 
A(g)p = 5p+i — 5p for every n,p. For any semimartingale V and any (random) interval /, we define the processes 
V{I)t and /( by V{I)t = /g li{s—)dVs and It = l/(t) respectively. Moreover, set 

k„-l fe„-l 

p=l 9=1 

and 

fc„-i fe„-i 

for each t £ and i,j E Z^. The following lemma is an analog to Lemma 3.1 of [30] and was used in [31]: 

Lemma A. 2. Suppose A', [X''], and [X'] are absolutely continuous with locally bounded derivatives for I = 1,2. 
Suppose also (A. 3) holds. Then a.s. we have 

limsupsup^ESL < II.9II00 sup |[Ai];|, limsupsup^EHL <L sup \[X}]'s\, (A.5) 
«->oo «eN . /2fc„f„log J- o<''<* «6N /2fc„f„logJ- o<''<* 



limsupsup ' ^ : < llgjloo sup |[A^];|, limsupsup ^— ^ '^^ < L sup \[XX\ (A.6) 

n^co rem hknfn log ^ Afc^f^ log ^ 



for any t > 0, where L is a positive constant which only depends on g. 

Proof. Combining a representation of a continuous local martingale with Brownian motion and Levy's theorem on 
the uniform modulus of continuity of Brownian motion, we obtain 

limsup sup < sup |[Ai];|, limsup sup ~— < sup [[A^J, 

5^+0 s,uelQ,t] . 25 log ^ o^''^* -5-^+0 s."6[o,t] ,/251ogi 0^^^* 

\s — u\<S V " \s — u\<S V ° 

where X' = -(ET^J'-) ' X}- Since AHX)j = - E'=o' A(g)^(A|,^^^^ - Xl^J and |A(g)^| < ^[HU, we 
obtain the first inequality in (A.5). On the other hand, since Abel's partial summation formula yields A^(I)J = 
bn'^' E'=o'{A(5)^i - Aig);}iXl^^^^ - Xh-.J^ and Aig);^, - A{g); = - + 1/^™) " 5'(^)}dx, 

the piecewise Lipschitz continuity of 5' and (2.5) imply the second inequality in (A.S). By symmetry we also obtain 
(A.6). ' □ 

We can strengthen Lemma A. 2 by a localization if we assume that (A. 3) and [SC2] hold, so that in the remainder 
of this section we always assume that we have a positive constant K and a positive integer no such that 

|Ai(I)j(a.)| + |Ai^(I)Kc.)| , |^,^(^)^(^)| + \X\iJ)liu^)\ 

sup — — — h sup — — — < K (A. 7) 

ten ^2fc„r„| log6„| jen ^2fc„r„| log fe„| 

for alH > and w e 51 if 77, > no. Moreover, we only consider sufficiently large n such that n > no. 
Next, set 

fe„-i fc„-i 

for each i, j G Z_|_. We denote by Eq a conditional expectation given J^^^\ i.e. Eq[-] := £'[-|J^(°^]. 

Lemma A.S. Suppose [SN^] hold for some r G [2,oo). Then there exists a some positive constant Kr independent 
of n such that 

EoWtiiyn < Krk-^'\ E,[\Q\jy\^] < Krk-^/' (A.S) 

for all i,j e N. 
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Proof. The Burkholder-Davis-Gundy inequality, Jensen's inequality and the Lipschitz continuity of g yield 



r/2' 



p=0 



fc„-l 



p=0 



hence we obtain the first inequality of (A. 8). By symmetry we also obtain the second one. 
Set r = [S\ J3 = [f^ rJ+'=") and R^{i,i) = S''+^^ V f^+'^" for each i,j e Z+. 



□ 



Lemma A. 4. Let c be a positive number. Suppose [SC2] and [SF] hold. Suppose also [SN^] holds for some 
r S (2, oo). Then for all t > we have 

oo oo 

J2 P (if (^)'l > c, N\r)t ^ 0, 3'+''- < t) ^ 0, ^ P (if > c, ^'(J^)t ^ 0, < i) ^ (A.9) 



j=i 
as n —> oo . 



Proof. Lemma A.3 yields i?o[|C (2^)*ri{A'i(/»)f5^o} 
we obtain 



< fcn '"^^l{Ari(/i)j^o} uniformly in i. Since A'^"'^ is a point process, 



and thus we obtain the first equation of (A.9) since r > 2 and fc„ -> oo as n — J- oo. By symmetry we also obtain 
the second equation of (A.9), and thus we complete the proof of lemma. □ 

Proof of Theorem 3.1. By a localization procedure, we may replace the conditions [F], [C1]-[C2] and [NJ^] with 
[SF], [C1]-[C2] and [SN^] respectively. Moreover, we can also replace the condition [T] with [ST] by Lemma A.l, 
while (2.3) can be replaced with (A. 3) due to the above argument. 
We decompose the target quantity as 



{|z'(i)-p>e^b:]}u{|z'(JM|2>e2[j]} 



l,j:W'{i,j}<t 

=;!( +nf +IIIf +IVt, 

where D[ :— J2k=i ^1 ^'^^ each I = 1,2. 

First consider I. By the Schwarz inequality and (A. 4), we have 

r 1/2 / 



{\z'(xy\^<eU^,\z^{jy\^<gm} 



1/2 



sup jl.l <— — 

0<s<t {WHYK„y 



J2 |x\x)fA-n 

i,r.R^(i,j)<t 



(,i:S' + '=n<t 



^ 1/2 / 



J2 \y(-{jyfK'n 



1/2 



{\z\xr\^>eUi]} 



Consider Y.i:si+kr.<t |X (I)Tl^|2\i)-|2>ei [,]}■ We decompose it as 



E |x'(^)Ti{i 



E ix'(^)f(i 

,j.5i+fc„<j 



{\x\xy\^>eU^,m{nt=a} + l{|z'(i)'|2>ei[i],7vi(7-)t#o} 



On {|X (I)Y > [i], 5"+'=" < t} we have 



IC iiy\ > |x iiy\ - \x\iy\ - \x\iy\ > ^. 



1 / 2fc„rn| log b„ 



Pn 
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by [ST] and (A. 7). Hence by (3.1) we have 

1 ^ - 

and thus (3.1) and (A. 7) imply that Ai^t ^ {Pn)~^~ J2i■si+^',^<t K (lYl^- Hence Lemma A. 3 and [SCI] yield 

E[Aij] < {bnKy'iknPn)-'^. (A.IO) 

On the other hand, since |/*(i)| < A:„f„ — > and is a point process, pathwise for sufficiently large n there exists 
a some index k{i) e N for each i such that D^{I)l = 7fe(i)7Vi(J')t. Hence by (A.l) we have \D^{I)l\ > B^^ on 
{|z\j)^P < gl[i\,N^{r)t ^ 0,3'+''" < t} for each i pathwise for sufficiently large n. Moreover, on {\z\iy\'^ < 
gi\i], \D^(I)l\ > B-\S'+''" < t} we have 

\c\iy\ > \D\i)i\ - \z\iy\ \x\iy\ - \x}im > b-' \x\i)\\ - \'x\i)\i 

hence by (A. 7) a.s. for sufficiently large n we have A2.t < Ei:S'+''n<i l^^(^)'i^^{|c'(i)i|>i/2S,Ari(7')t5^o}- Therefore 
Lemma A. 4 yields 

A2,t = Op ((6„fc„)"^(fc„p„)-'^^) . (A.ll) 

By (A.IO) and (A.ll) we obtain Ej:Si+fc„<t l^^(^)*l^l{|z'(i).|2>ei [,]} = Op (^(6„fc„)-i(fc„p„)~^^ , and by symme- 
try we also obtain Ei:fj+fc..<t ^^^^'^'>^^'^^{\z'{jy\^>glU]} = ((&nfcn)^^(fcn/0„)~^) ■ Consequently, by (2.5) we 
have supo<,,<i |I,| = Op (^(bl/^Pn^^ ' ^ . 

Next consider II. Since D^(iy = on {N^{r)t = 0}, we have 

Tj—rT2 ^\lYtZ (j')^^'^l{|zi(i).|2<gi[,],|z^(^).|2<g2[j]^jvi(/-)*#o}' 

1 /4 

and thus an argument similar to the proof of (A.ll) yield supo<5<( |IIs| = Op(6„ ). Similarly we can show 
suPq<;^<j |nis| = Op(bl/^) and supo<s<t \TVs\ — Op(bl/^). Consequently, we complete the proof of Theorem 3.1. □ 



B Proof of Theorem 3.4 

Exactly as in the previous section, we can use a localization procedure for the proof, and which allows us to 
systematically replace the conditions [A4], [A6] and [K^] by the following strengthened versions: 

[SA4] ^ V ^ < and (A.3) holds. 

[SA6] There exists a positive constant C such that b~^Hn{t) < C for every t. 

[SKfi] We have [K^] with ^ =: E and (A')', [M']', (A')' and [M']' {I = 1,2) are bounded. Moreover, there is 
a non-negative bounded measurable function ■(/; on E such that 

sup \5\oj,t,x)\<^{x) and [ ifjixf F'{dx) < oo, I = 1,2. 

Next, an argument similar to the one in the first part of Section 12 of [23] allows us to assume that < ^ < ^' < 1 
under [A2]. Furthermore, in the following we only consider sufficiently large n such that 

Kfn < hi-^/\ (B.l) 

Note that we can use Lemma 11.2 of [31] under [A2] and [SA4] in this situation. 
Now we prove some auxiliary results. Let 

N' l{|5,|>i} := k{5') * {^Y - v^). 

First we need the pre-averaged versions of some lemmas in Section 6 of [30]. For processes V and W , V •W denotes 
the integral (either stochastic or ordinary) of V with respect to W . 
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Lemma B.l. Suppose [ST], [A2]j [SA4] and [SK2] hold. Then for any t > we have 

00 00 

P (N\r), + 0, \L\T)\\^ > [z]) ^0, {N'(J')t + 0, \L\m^ > 4gl[j]) ^ (B.2) 

i=i j=i 

as n —> oo . 

Proof. Since and have no common jump, Ito's formula yields 

N^{r)\P{iy\^ = \L\iy\^_Pi . iv^ + 2N\r)^L\iy_ • L\iy + N\r)^ • [Z1(t)'], 

and thus wc obtain E[N^{P)t\L^{iyt\^] = E[\L^iy\lP_ • A]] + E[N^{P)^ • {L^{iy)t] by the optional sampling 
theorem, where is the compensator of 7V^. Since = l{|5-i|>i} * t/^ and {L^{iy) =■ X]p=7^ 5p (^-^^ • (-^^)) = 
Sp=T^ 5p ★ t^^, by [SK2], [A2] and the optional sampling theorem we have 



E[N\Py\L\iyf] < f E[\L\iyfll]ds+Y^ f E[N\P)Jl^P]ds 
Jo p^l Jo 

= f E[{L\Ty)Ji]ds+ fm'insr^^ds, 

Jo p^l Jo 

and thus again [SK2] and the representations of A^ and (L^(I)*) yield 

p=l Jo p^l Jo p^l L 

Since [SA4] implies J^Zi < rnt, we have 



i=i i=i 

and thus (3.1) yields the first equation of (B.2). Similarly we can prove the second equation of (B.2). □ 

Let (pp{e) = /{i/'<e} V'(^)^^'(da^) foi' each p £ [/3, 00). The following lemma is the same one as Lemma 6.7 

of [30], and will be useful to prove the lemmas below. 

Lemma B.2. Suppose [SK^] for some /? G [0,2]. Let p be a positive number and (p,i) be a sequence of positive 
numbers which tends to 0. Then there exists a sequence of numbers £„ € (O7 1] such that 



liuisMPn'^lrVpiSn) < 1 (B.3) 
n— f 00 

and 



'fi/S (.Era ) ^ 0, VP^/e„ ^ (B.4) 

as n —> 00 . 

Proof. The strategy of the proof is the same as the one in the proof of Lemma 7.4 of [3]. Let 

a-n — sup{y e (0, oo)\yP(p,3{yy^i) < 1}, a„ 1 V {a,^ - 71"^). 

Since (fipie) — )■ as e — )■ 0, for any C > there exists a positive number Eq such that e < Eq implies (pi3{s) < C~^. 
Moreover, since /5„ — > as n — > 00, there exists a positive integer uq such that n > uq implies p„ < Eq/C^, hence 
C''^Lpp[C yfp^) < 1. Therefore we have — > 00, hence a„ — > 00. Furthermore, for sufficiently large n a„ < a^, 
hence a^ip p{any/p7i) < 1- Therefore, if we put e„ := On^ffhi /\ 1, we obtain (B.3) and ^fjhilen 0. Moreover, since 
^pi^n) ^ ^pio-n^/Pn) < Q^'' " 0, WC Complete the proof. □ 

We introduce some auxiliary notation. We introduce aa auxiliary sequence (£„) of numbers in (0, 1] such that 

limsup-^^-^ < oo, (B.5) 

n— fO 
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and we set En := {x G E^iplx) > e„}. Wc will more specify the sequence (e„) later. Furthermore, we put 

for each Z = 1 , 2 and 
for each i, j G Z^. 

Lemma B.3. Suppose [SK^] holds for some j3 G [0, 2]. T/ien /or any t > we have 



(B.6) 



i=i 



Proof. Since ii^ 



E'l^'UOtJ = E^=i^^[A'(^')t] ^ e;:''E.=i^^ < ^n^knt, we obtain the first equation 

of {B.3). By symmetry we also obtain the second one. □ 

Lemma B.4. Suppose [SK^] holds for some (3 G [0, 2]. Then for any t > we have 



E 



E 



Proof. Since E[\E.^^){im = (I)')*] < ^^{en)E [Ep=7 l^^+^OlJ and i?[|e(I)j|2] < 

we have 



(B.7) 

;2(^-l) + ^[|p(^)|2]^ 



E 



Since </32(£n) < (^n)^ ^'■Ppisn), {P — 1)+ < /5/2 and A:„r„ = o((e„)^) by (3.1) and (B.5), we obtain the first equation 
of (B.7). By symmetry we obtain the second equation of (B.7). □ 

Lemma B.5. Suppose [ST] and [SK^] hold for some (3 G [0, 2]. Then for any t > we have 

oo oo 

E -'-{|£H5)JP>cei[il} = E -'-{lL2(j=)J|2>ce^2)j^.jj = Op(A:„p,//^). (B.8) 

i=i j=i 

Proof. Combining Lemma B.3 with Lemma B.4, we obtain 

oo oo oo 2 

2—1 ?.— 1 'i — 1 

hence Lemma B.2 with p = 1 yields the first equation of (B.8). By symmetry we also obtain the second one. □ 
Lemma B.6. Suppose [ST] and [SKp] hold for some (3 £ [0, 2]. Then for any t > we have 



Ei^'(^)*i'i 



{\L\I)i\^<cgUi]} 



1=1 

Proof. Since 



oo 

Op {knp'n-'/') , E \^'(^yt\'U\LHm^<c,im = Op {knpl-^/') . (B.9) 



2—1 2—1 2—1 

QO OO 



Lemma B.3 and Lemma B.4 yield Ei^i l^^(2^)tPl{|Li(i)5|2<cei [i]} = Op {{{en)'^fi3{£n) + Pn}kne~'^) , hence by 
Lemma B.2 with p = 2 we obtain the first equation of (B.9). By symmetry we also obtain the second equation of 
(B.9). □ 
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Next we prove some lemmas which deal with the events that the noise part corrects the effect of small jumps. 

Let t-jn := {knPn)~^- Set 



2 



p=l q=l 

for each i^j G Z^. 

Lemma B.7. Let ci and C2 be two positive numbers. Suppose [ST], [SA4] and [SK^] hold for some /? G [0,2]. 
Suppose also that [SN^] holds for some r e (2, 00). Then for any t > we have 

Proof. Consider (B.IO). We decompose the target quantity as 

E (^)*l^^{|Y'(i)'|2>cieiH,|Li(i)sp>c2eiH} = E 1^ (^)*l^-^{|Y'(i)'|2>ciej,W,|Li(i)iP>c2eJ,W,ivi(/')t7^o} 

j.5i + fc„<t i:S'+'="<t 

+ E 1^ (^)Tl{|xi(x).|2>cie},H,|Li(i)j|2>c2ei W,Afi(/')t=o} 

,;.5i + fc„ <t 

=: Ai^t +^2,t. 

By Lemma B.l we obtain Ai t = Op (^rjn^ p„ . On the other hand, on {|X > ciei[i]} we have 



if (1)^1 > ix^(i)'i - \xHm \x}{m >vj^J,f^~2K. 



Ko \ Pn J 

by [SA4], [ST] and (A.7). Hence by (3.1) we have 

"^2:* - _E 1^ ^^y\^^{\LHX)l\^>sUA,\C\l)'\^>c,p„/4Ko} 

l:S'+'="<t " j:S'+'="<t 

and thus (3.1) and (A.7) imply that Ai,t < (p™)"^ I]i:S'+'="<t K^(^)Tl{|Li(i)j|2>c2eiH}' Lemma A. 3 we 
obtain Eq [^2,*] < ?7„^ l{Li(i)j|2>ei [i]}- hence Lemma B.5 yield A2,t = Op {r]n^ Pn^^^^ ■ Consequently, 

we obtain (B.IO). By symmetry we also obtain (B.ll), and thus we complete the proof. □ 

Lemma B.8. Suppose [ST], [SA4] and [SK^] hold for some /3 G [0,2]. Suppose also that [SN^] holds for some 
r G (2, c»). Then for any t > we have 

E l-^^(^)tl-^{|z'(I)'|2<eJ,W,|Li(I)j|2>4ei [j]} = ij^nVn'^Pn'^'^^ 1 (B-12) 
j.Si+fc„<j 

E '■^^'-'^•**l"'"{|Z^j')J|2<e^b],|£2(J)JP>402[i]} ^ °P {^nVn'^Pn^^^^ ■ (B.13) 
j:f j + fe„ <t 

Proof Consider (B.12). We decompose the target quantity as 



)-fc„<t 

E (l{|c'(i)-p>ei [,;]/4.|z\i)'P<ei [,;],|Li(?)jp>4ei [,]} + ^{if ( 



|z^(i)'P<ei [i],|Li(i)jp>4ei[i]} 



/{|C^(i)-p>eiHM|z^(i)'P<eiH,|Li(2:)IP>4eiH} ' "{K'(i)>P<eiW/4,|z^(i)-|2<ei w,|Li(i)jp>4eJ.W} 

Bi^t + IB2,t- 
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By the Schwarz inequality we have 



< 



'{\C'{iy\^>eUi]/*.\L\i)i\^>4eim 

i:Si+fc,i<t 

11/2 |. N 1/2 

] l{|£imjP>4ei[i]} } 

I j.gi + fc„<t J 



[ST] and Lemma A. 3 yield 



E 



{\C{xr\^>eU^]/i} 

i:S'+'=n<t 



oo oo 



4=1 i=l 



Combining this with Lemma B.5, we obtain Bi t ~ { knVr/^ Pn ' ' ] . On the other hand, on {|Z < 



1^1 / 



gi[i], > 4^^l HI we have |Y {iy\ > \L\l)l\ - |Z {Tj\ > y/g^M- Moreover, by (A.7) we have 

\D\i)i\ > \Y\iy\ - \x\i)i\ - \c\iy\ - \x\iy\ > ^„ 



1 „ /2fc„r„|log&„ 



2%A/?o V Pn J 

on {|c\^)i^ < ft\H/4, |y\i)^|2 > < t}, and thus (3.1) yields \D\l)l\ > 0, hence 7Vi(r) ^ 0. 

Therefore, we obtain 

^2.t < E l^^(^)J|l{|Li(X)jP>4ei[i],Ari(/-)t5^0}' (^-14) 

and thus Lemma B.l yields M2,t ~ Op {kniln Pn^ ^ ■ Consequently, we obtain (B.12). By symmetry we also obtain 
(B.13), and thus we complete the proof. □ 

Lemma B.9. Suppose [ST], [SA4] and [SK2] hold. Suppose also that [SN^] for some r G (2, 00). Then for any 
t > we have 



E l'^^(^)tl^^{|z'(x)'P<e^[i],|Li(i)jp>4ei [i]} - (j^nVn'^'j , (B.15) 
E ^^'^^'^')^^^'^^{\z^{J)^^<ellj]^L'^J)l\^>4el[j]} ^ Op{knifJ^^ ■ (B.16) 



Proof. Consider (B.15). We decompose the target quantity as 



^{|z^(i)'|2<e,\W,|Li(i)jp>4e,\W} 



= E l-^^(^)tl^ (x)'P>e,\W/4,|z\i)-p<eiH,|Li(i)j|2>4e,\H} + l{lc'(2)'P<eiH/4,|z'(i)'P<eiW.lii(x)JP>4eiH}) 

=: ^ + r2,t. 

An argument similar to that in the proof of (B.14) yields T2^t — Op (jinVn^^^ ■ On the other hand. Lemma A. 3 yields 

00 00 
E[ri,t] < pVJ'^Y.^ [\L\i)\\'E,[\c\im] < v:/'Y.E[\r{t)\] < r^^/^Kt, 

i=l i=l 

hence we obtain (B.15). By symmetry we also obtain (B.16), and thus we complete the proof. □ 

Let K]-' = l{/i(t)njj(t)5^0} each i,j e Z+ and t e M+. 
Lemma B.IO. Suppose that [A2] and [SA4] are satisfied. Let t he a G^^^ -stopping time. Then HKJ^^ is J^gi^fj- 

(n) 

measurable for any i,j £ Z+, provided that H is a Qj^^ .ymeasurahle random variable. 
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Proof. Let B = {P{t) n ,P (t) 7^ 0}. It is sufficient to show that AnBnC G J^u for any u e M+, where A e tj^"'^. 

and C = {5* A f ^' < u}. On S wc have li^iij) - A < |r| V | V (5'*+'^" - f V {tJ+^^ - 5*) < fc„f„, hence 
R^{i,j) = {ii^'li, j)-S^'Afj} + S^'Afj < S'hTi + kr,fn, and thus we have B n C = BnCn{i?''(i, j) < u + A:„f„}. 



Since A, B e ^^"''(j j), we have Ar\ B r\ {R'^ {i , j) < u + fc„f„} e 

^M+fc„?'„i however, Gu+k„fn 

by (B.l). This together with the fact that C e J"„ imphes An B n C e J^u- 



C Tu 
□ 



Lemma B.ll. Suppose [SC1]-[SC2], [A2],, [SA4], [SA6], [SK2] and [SN^] hold. Then for any t > 0, there exists a 
positive constant K independent 0/ both n and (e„) such that 



E 



sup 

0<s<t 



sup 

0<s<t 



E 



sup 

0<s<t 



i.j-.R}' {i,j)<s 
%3:B^ (i,j)<s 



< Kkl^/^^)b]/\ 



Proof. First consider (B.17). since integration by parts and Lemma 4.3 of [31] yield 
we can decompose the target quantity as 



(B.17) 
(B.18) 
(B.19) 

(B.20) 



iJ-.R'^ (i,j)<s 



i.j=Q 

--: Ai,, + A2,,, + A3,,. 



J2 ^(^rsM\j)iK^ 

i,j:R^{i,j}>s 



First we estimate Ai,,.. Since = Y/p'U Qp^'^ {I'^^') s and ^^{JY, = T/q'Li gqM'^{J'^'')s: we have 

00 i-\-kn — l jA-kn — l 00 

= E E E 5;-.5^,^l^Si(/'+'')-./^+'-A4'= E "(^''9)-^ 
where ^b,?),, = Er(p-fc„+i)+ E|=(Vfc„+i)+ 9p-^9q^]KlK Hence we have 



(Ai,.). = E 
9=1 



E"(p,9)-si(/p)_ 
.p=i 



and thus we obtain 

QC 

i?[(Ai,),]<E / E 









[1 







00 pf 00 

d- = E / E^K''^)'^^^[i-'(^')^H-^S-. 
— 1 1' — 1 



g=l"'0 



ds 



E^b,'7).Si(^)J jq 

q=l I l,p=l J 

00 p— 1 g— 1 

<fc>2(£„) E E E i?[Af IFWIk^Wl] <fcn¥'2(£n)fc„6„<fc>2(£„)^^y' 

p,9=l i=(p_fc„+l)^ j=(q-k„+l) + 

by the representation of {AP) and (S-^), [A2], (A. 4), [SA6] and (2.5). Combining this with the Schwarz and Doob 
inequahties, we conclude that 



E 



sup \Ai,s 

0<s<t 



< 



kl4^^)h]l\ 



(B.21) 
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Similarly we can also show that E [supq<^<j |A2.s|] < k'^^^ Lp2{£n)bll^ ■ Now we estimate As^^. (B.20), the Doob 
inequality, [A2] and the optional sampling theorem, (A. 4) and [SA6] imply that 



E 



sup £ \^\T)\M\J)IK^^\ 



0<s<t 



i,j=0 



< 



J2 K''\pmj'it)\ 



< 



(B.22) 



On the other hand, (A. 4) also implies 



J2 < (2fcn + 1) 

ij:i?^(ij)>s 



E 1+ E 1 



< (2fc„ + l)kn. 



(B.23) 



Therefore, the Schwarz inequality and 2.5) yield E [supq<^<( lAa^gj] < \/ ifi{£n)kn • Ki ^ k'^\/^2i£n)bl/^ ■ Conse- 
quently, we conclude that (B.17) holds. (B.18) can be shown in a similar manner. 

Finally consider (B.19). Define the process 3t by 3t = V^X^jli Cf, l{f 3<t}- Then obviously 3t is a purely 

discontinuous locally square-integrable martingale 6'^°-' and {jy = '}p'{JY on < t\. On the other hand, 

since 'E} is quasi-left continuous by Theorem 1-4.2 of [27] and for every j T'^ is F^°)-predictable time by [A2], we 
have ASi,^ = for every j. Therefore, we have [S^,3^] = 0, and thus we can decompose the target quantity as 

i,3:R'^ (i,j)<s 



i,j:R'^ (i,j)>s 



ij=0 

=■■ Ti,, + T2,3 + T3,. 
due to integration by parts and Lemma 4.3 of [31]. First we estimate Ti,^. Since 

oo kn~l kji — l 

Tm=6-^/^ E E Y a-pMaY^K'-^^'ii-'-n-Ji-'' '51 

i,j—0 p—1 q—0 

oo i-\-kn — 1 j~\~kn — 1 OO OO 

=^«'^' E E E 9;-Ai9r,-,K'iE\r+n-j'-''' = b-^^^ 

i.j—0 p— 1+1 q—j p—1 q—0 

where v'{p, g), = T^Zl-.^+^u ^U.-^.+m g^-AiaY^-.Kl^ ■ We have 

2 



[Ti..]. = E 

g=0 



.P=i 



hence we obtain 



g=0 



OO oo 



q=0 p=l 



oo oo P^l 



< 



(=«)EE E E E Ky\P{r)\ <^,{e,,)kl<klv,{e,,)hl 



1/2 



9=0 p=l i={p-k„ + l)+ j=(q-k„+l) + 



by the optional sampling theorem, the representation of (^^), Lemma B.IO, the Lipschitz continuity of g, (A. 4) 
and (2.5). Since (Ti,.) is the predictable compensator of [Ti_.], the above result and the Schwarz and Doob 
inequalities yield E [supo<s<( ITi^^l] ^ kf^y' ip2(£n)b\!^ ■ On the other hand, we can show that E [supQ<g<j |'^"2,s|] ^ 
^n^f V^i^njihl'^ in a similar manner to the proof of (B.21). Finally, since 



E 



sup \^\irsC{jysK^y\ 



0<s<t 



i,j=0 



OO 






OO 


^E^ 

2 = 


sup \E\iyf 

0<s<t 


< ^2{£n)E 


E i^'^wi 



< 



<y52(£n)fc»i 



by (A. 4) and the Doob inequality, the Schwarz inequality and (B.23) yield E [supQ<;^<j 1^3,^1] < k'^^J if2{£n)bn^ ■ 
Consequently, we obtain (B.19). □ 
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Lemma B.12. Suppose [ST], [A2], [SA4] and [SK^] hold Jor some /3 e [0,2]. Suppose also that [SN^] holds for 
some r £ (2, oo). Then for any t > we have 



K sup 

0<s<t 



K sup 

0<s<t 



K sup 

0<s<t 



t.J:R^{l,j)<s 
i,j:By(l,])<s 



(B.24) 
(B.25) 
(B.26) 



Proof. Define the process by = Eij:flv(»j)<s Q^{X)\M'^{J)IK'^ . If > 1, the Schwarz inequahty, [SK^] 
and [SA4] yield 



E 



sup \Ts 

,0<s<t 



< E 



<<E 



hj=i- 

1/2 



E 



1/2 



= o {{Kr^f'^p-^p-^y^) 



hence we obtain supo<s<t [T,! = Op ypli ^^^j . If /3 < 1, we decompose the target quantity as 



i,j:R^ {i,j)<s 



where ~ k((5^)*i^^ and 0^ = k((5^)1(£;i)c*j/^. Then, by [SAG] and an argument similar to the above, we can show 
supo<,<t |T2,,| = Op (fc2pii"^'/2^„6y') = Op{klbU^). On the other hand, note that < Sn''^-'^^ knfn < 

i/fc„f„| log bn\, an argument similar to the proof of Lemma 6.2 of [31] yields 



sup 

0<s<t 



hence Lemma 12.1 of [31] yields supp^^^^ [TTi^sl = Op (j^n^l/^^ since we have the conditions [A2], [K^](v) and [SA6]. 
Consequently, we conclude that (B.24). Similarly we can also show (B.25) and (B.26). □ 

Proof of Theorem 3.1. By a localization procedure, we may replace the conditions [K^], [C1]-[C2] and [N^] with 
[SF], [C1]-[C2] and [SN^] respectively. Moreover, we can also replace the condition [T] with [ST] by Lemma A.l, 
while (2.3) can be replaced with (A. 3) due to the above argument. 
We decompose the target quantity as 



1 



U.j:R^{l.j)<t 



Y^(I)-|2<4e„W,|Y"(J)jp<4e„[i]} 



1, 



X 



\irx\jy] 



iJ:R^ {i,j)<t 



l,j:R-'{i,j)<t 



(a) By Theorem 3.1, we have 



sup 

0<s<t 



|I.| =Op(6y^) + Op (r^:"). 



(B.27) 
(B.28) 
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(b) Next consider II. We decompose it as 
1 



Y {lyy {jyK'^ {}'{\7.\iY?<Q\[iui\jv\^<eim}^{ 



i,j:Ry (i,j)<t 

— 1 -1 - -2 - 1 



^{|z^(i)'P>ei W}u{|z'(^)jp>e2[j]}-{Y'(i)>p<4ej,H,Y'(JMP< 
First estimate Ili^f. We decompose it as 



111,* 



^ (-^{|Y<''(i)-p>4eL''W,|Y<''(j')jp<4e<?'[j]} + -^{|Y<''(i)-p<4e<."W,|Y<"(j')jp>4ek"[j]} 
+ -^{|Y<''(i)'|2>4e<.'>W,|Y<"(j')jp>4e<f'M} 

=:II^^] +l4^f^ 

Consider 11^^^^ The Schwarz inequality, (A.4), [SCI] and (2.5) yield 

sup |n«| 

0<s<t 



11/2 J- >. 1/2 

< J2 1^ ('^)'l^l{lY^(JPP<4e2[j],|z2(j)j|2<p2y,} > 

!>. 1/2 
J2 1^ (^)'l^l{|zi(i)'p<ei^W,lY\i)'p>4ei W} \ 
i:S'+''n<t ) 

<Pr/M X] |Y l{|zl(I).|2<el[i],|Yl(X)>P>4eiW} [ ■ 

li:Si + fcn<t I 



On {|Z < ft\ [z], |Y (I)'f > 4£.,\ HI we have \L^{I)\\ > |Y - |Z > hence we obtain 



and thus Lemma B.7 yield 



{|Li(i)jP>ei W,|Y^(i)'P>4eiW}' 



0=o,{v^pI/'-^/'). (B.29) 
By symmetry we obtain 

sup \ll^;'],\ = oJr,^pl/'-^/*). (B.30) 

0<s<t ^ ^ 

On the other hand, the Schwarz inequality and (A.4) yield 

sup|nSl<-J— / iyVtvi2i _, . ] 

a<a<t IpuY'^'i 



sup I 

0<s<t 



X 



N 1/2 

E 1"^ ('^)''l^^{|Y"(J)J|2>4e2[jj,|z=(j)J|2<e2[j]} > 



hence by the same arguments as the above we obtain supo<g<t = Op ffc^ ^rjn^ Pn^^'^^ , and thus /3 < 2, (2.5) 



and (3.1) yield 



sup I 

0<s<t 



4'il=0p(r/„'"). (B.31) 
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(B.29), (B.30) and (B.31) yield 

sup IIIi,,! = Op (rj^) + Op (pl-^/'') . (B.32) 

Next estimate M.i^t- Note that (A. 4), we obtain 

sup llla.tl 

0<s<t 

^ ^{|zi(i)'|2>ei^[i],|Yi{i)'|2<4eJ,[i]} + X] ^{|z"(J)Jt^>e|[jl.|Y"(j')J|2<402[j]} r 

l,i:S* + ''n<t j:fj + '=n<t J 



+ "'"{|z'(2;)'P>eJ.W,|Y\i)'P<4ei[i),|c'{x)'l^<eiW/4} + X] -"-{Iz^jjJ p>e2 [j]^|Y2(j)i |2<4g2 y]_|<.2(j,jj |2^^2 j^^^^j 

[SCI] and Lemma A. 3 yield 

51 ^{|c'(i)*P>eJ.W/4} ^ ^w^*^"^^' X! ^{if (J')jp>eJ[il/4} ^ (B.33) 

On the other hand, on {\y\lY\^ < Ag^n^\il iti^Tl^ < 3'+''" < t} we have 

\D\I)1\ < \Y\iy\ + \X\I)1\ + \c\iy\ + \X\m < 57^/2 + 2AV2fc„f„| log&„| -> 0, 

hence a.s. for sufficiently large n we have D^{X)\ ~ 0. Moreover, on {|z\l)*p > g\[i], < £'^[i]/4, D^{I)\ = 

0, 5*+'=" < t}, by (A. 7) we have 



hence (3.f) yields \L\Xy \ > ^p„/9Ko. Therefore WG obtain 

oo 

^{\z\xy\^>ei[i],\Y\xy\^<4gi[z].\c\xy\^<gUi]/4} - l{|£i(I)jP>p„/9ifo}- 
By symmetry we also obtain 

oo 

Combining these results with Lemma B.5 and (2.5), we obtain 



sup 

0<s<t 



|Il2,t I = Op (r,:" ) + Op [pI-^/') . (B.34) 
By (B.32) and (B.34), we conclude 

|n,| = Op [v^) + Op [pI-^/^) . (B.35) 



sup 

0<s<t 



(c) Next consider III. We decompose it as 



Li(I)j|2>4ei[i],|L2(_7)J[2>4g2[j]} -r ^{|Li |2>4ei W,|L2( j)J |2<4p2 
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First we estimate IIIi,t. The Schwarz inequality and (A. 4) yield 



sup I nil, si 
0<s<t 



{|z^(i)'|2<ei[i],|Li(i)j|2>4ei[i]} C ^ I' > ' '-{\z\j)j\2<Bl[j],\L^{j)l\2>4,ellj]} 



On {|z'(i)^f < ^2 [j], \LHJ)1\^ > 4gl[j]} we have \Y\jy\ > \L^{J)l\ - |z'(J>| > ^/^], hence Lemma B.7 
and Lemma B.9 imply that 

|IIIi,| = o, {k-'lS'^p'JI') ^ o, (rfl^pT"'^') ■ (B.36) 

Next we estimate IIIz,*. On {|z'(^)J|2 < Ql{j\ \L^iJ)l\^ < igUj]} we have |y'(^)J| < |z'(j')J| + \L^{J)i\ < 
hence by (A. 4) we obtain 



sup 

0<s<t 



^sup Jin2..| < ^ 5] \L\iy\i^ 

i:S^+''"<t 



and thus Lemma B.8 yields 



sup \llh.s\=o,U/^pl/'-^/^). (B.37) 

0<s<t ^ ' 



Next we estimate Ills,*. By the Schwarz inequality and (A. 4) we have 

1/2 f N 1/2 



{|z"(J)il2<e2[j],[£2(i)j|2>4g2[j]} 



Note that on {\r {jy\^ < gl[j],\LHJYt\^ > 4gl[j]} we have \Y\jy\ > \L^jy,\ - |Z^(^p| > ^^], Lemma 
B.4, Lemma B.7 and A;„ ^ = o(p„) yield 

sup 11113..! = Op {kyyl''pT~^l''nf\ = oj, U-^/2r7„^) . (B.38) 

Finally we estimate IIl4.t. First we specify By Lemma B.2 we can choose the sequence (e„) satisfying 

(B.3) for p = 2 and (B.4). Next we decompose the target quantity as 

™4,t =-^— 51 (^)^^'^i{|z^(i)»p<ei[»],|z'(^P|2<e2[j]j£i(x)j|2<4gi^^ 

X fl/ 



=:ii4\' +ffli^t' +inf]. 



By Lemma B.l, we obtain supo<3<i = Op(6y''). Moreover, on {|Z^(J)J|2 < g2 [j]^ |£2(jp|2 < 4^^ [j]} we 

have \Y^ijy \ < \L'^ijy\ + \i^ijy\ < and thus by (A.4), Lemma B.3 and (B.4) we have 

OO 

sup |mi']| < fc-VnE i{^M/')^o} = Op [ply^'/') . 

0<s<t j^j^ 

On the other hand, since 



we can decompose it as 



1 



^ ^{|Z^I)'|2<eiH,|z'(JM|2<e2Jj-]j£l(I)j|2<4gl[,]j£2(J)^-|2< 



^ L\iy [{A\jy +A\jy} + M\jy + M\jy + c\jy] 



«j:-R^(*j)<t 

^ l{JV2(Jj)t = 7Vl(/')t=0,|Z\l)'|2<el[j]Jz"(J)J|2<e2y]J£l(I)j|2<4gl[,;]^ 

=: Ti t + r2 f + Fs t + r4 f. 
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First consider Ti^f By the Schwarz inequality and (A. 4), we have 



1/2 f ^ 1/2 

J|2 



0<s<t 



hence Lemma B.6, the boundedness of (A^)' and (^4^)', the Lipschitz continuity of g and [SAG] yield 

|ri,| = o, {k-y^pl/'-^/') ^ o, (bl/^) . (B.39) 



sup 

0<s<t 



Next consider Tz,*. Since 2,^(1)1 = ^^(I)^ + 6^(1)^ on {N'^{J3)t = 0}, we can decompose the target 

quantity as 

„. p(i) I r(2) 

— ■ ^ 2,t "T ^ 2,t ■ 

First estimate r2^j. We decompose it further as 

l,j:i?v(jj)<t 



^ ^{w^(J^)t7^o}u{JV2(j^),5^o}u{|z'(i)M^>eJ.W}u{|Li(i)jP>4eJ.H}u{|z'(j)^p^ 

— • ^ 2,t ^ 2,t • 

By Lemma B.ll we have supg<5<j |r^^]'| = Opibll'^). On the other hand, (B.22), the Schwarz inequality, (A.4), 
Lemma B.3, Lemma B.5, (B.33) and (B.4) yield 



sup 

0<s<t 



Since we have 

Men) < ejelifipisn) < ple-^-^ = o (p]-'"^) (B.40) 

due to (B.3) for p = 2 and (B.4), note that fo„ = o{p'^), we obtain supo<s<t [Fj^]"! = Op (^blJ^ pl/^~^^^^ + 
Op (^pli^ — Op (^pli ^^'^^ + Op (vn^^ ■ Consequently, we conclude that 

^sup ^ IF W I = Op {bl/') + Op [pi-'/') + Op . (B.41) 

Next estimate r2 I . We decompose it further as 

^ " ^{N^j^)tmu{N^J^)tmu{\z\x)^'^>eUmu{\LHx)i\^>4:eUmu{\z^ 

^ 2,t ' ^ 2,t ■ 

Lemma B.12 yields supo<^<t |F^^]'| = Op (bl/'^^ + Op (^pi'^^^Y On the other hand, since Q^{T)\M^{J){kI^ = 
K'%^(iy_ • M'^iJYt + K'lM'^{J)t • e^(I)j by integration by parts and Lemma 4.3 of [31], we have 



E 



< k^f £-2(^-1)+ K 
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by [A2], the optional sampling theorem and the inequality \e^(lP)t\'^ < e„^'-'^ and (A.7). Therefore, 

(A.4) and [SA6] yield 



E 



J2 \e\i)lM'iJ)lK'''\' 

i,j:R^ {i,3)<t 



< ^~2(P~1)+, < ,2- -2(/3-l) + 



and thus an argument similar to the above yields 



Q<s<t 



Consequently, we conclude that supo<s<j iFj^^j — Op + Op (''"^^) (f"" ^^^) ■ Combining this result with 

(B.41), we conclude 

(B.42) 



sup 

0<s<t 



Similarly we can also show that 



^s^up^ |r3,.| = Op (bi/^) + Op {rj:p) + Op 



(B.43) 



Now we deal with r4_t. Since L^{i)l — £^(i)t on {N^{J^)t — 0}, we can decompose the target quantity as 



4,t 



X 1 - 1 



J2 2}{x)k\jYk^' 



{Ar2(jj),^o}u{iv2(jj)t#o}u{|z'(i)M^>dW}u{|£i(x)jP>4eiH}u{|z'(j^)^p>e?ji]}u{|^ 

-"-4.* I 4,t ■ 

By Lemma B.ll and Lemma B.12 we have supg<j,<;( IF^^]] = Op(bl/^) + Op{p\ ^^'^). On the other hand, by the 
Lipschitz continuity of .g, (A.4) and [SNj], we have 



Eq 



J2 r^Hm'ic ijyi'K' 

iJ-.R-^ {i,j)<t 



oo 



i\2 
t\ J 



hence the Schwarz inequality, (A.4), Lemma B.3, Lemma B.5, (B.33) and (B.4) yield 



-Op (k-^el-PI^MeuY'^kl/^p-J") + Op {k-\\-^l^^,{e^Yl%l^l\^l') 
Since we have (B.40) due to (B.3) for p = 2 and (B.4), note that 5„ = o{Pn)^ '^'^ obtain 



sup 

0<s<t 



sup 

0<s<t 



iri^ll -Op {h\l'pT-'^') + o, [pT-'^ST) = o, {pI'/') + Op (^;/^) . 



Consequently, we obtain 



sup 

0<s<t 



\U,\-Op[bl,^')+op{pl'^/'-)+op{n:^') 



By (B.39), (B.42) and (B.43), we obtain supo<,<4 Op ( bi/" ] +Op{p 



we obtain 



1/4 



1-/9/2 



sup I 

0<s<t 



hM = op{bi/')+op[pi-^^')+op[^:/'). 

Note that r/„ ^ o(l) and /3 > 0, (B.36), (B.37), (B.38) and (B.45) yield 

^sup^ |III,| = Op {bl/') + Op [rj:/') + Op . 



(B.44) 

(jlr/^^ ■ Consequently, 
(B.45) 

(B.46) 
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(d) By symmetry, we obtain 

^sup ^ |IV, I = o, (bl/^) + o, (rfj^) + Op [pi-^/') . (B.47) 

(e) Finally we consider V. We decompose it as 



iJ-'R"^ {i,j)<t 



^ [^{LHI)ir->40i^li],L^J)l\^>4eim + ^{LHi)l\2>4el,[i].L^J)l\^<4gl[j]} 

+ ^{LHi)i\^<40ili],L^jy,\-^>4eim + ^{LHi)i\^<4sUi],L^iJyj'<4ellj]} 

,t+V2,t+V3,t+V4,t. 



By the Schwarz inequality, (A. 4) and Lemma B.9 we have supp^^^^ \^i,s\ = Op yln j ■ Moreover, by (A. 4) and 
Lemma B.8 we have supq<^<j |V2,s| = Op ^^J/^ (hi"^'^^^^ and supq<^<j \^i,,s\ ~ Op {^ifj^ (hi'^~^^'^^ ■ Furthermore, 

1-/3/2' 



the Schwarz inequality, (A. 4) and Lemma B.6 yield supo<s<t \^ 4,s \ = Op ypn j ■ Consequently, we obtain 

sup I I = Op + Op . (B.48) 

Note that ?7„ = o(l), (B.27), (B.28), (B.35), (B.46), (B.47) and (B.48) yield 

^sup^ \PTHY{Z\ Z'): PHY{X\Xy:\ = Op (bi/^) + Op [rj^) + Op (p^^/^) . 

Since = (fcnP„)^^ = 0{bl/^ p~^), we complete the proof of Theorem 3.4. □ 



C Proof of Proposition 4.1 

By a localization procedure, we may assume that [SC1]-[SC2], [SA4], [SA6] and [SN2] hold. In a similar manner 
we may also assume that = E and that there is a non- negative bounded measurable function ip on E 

such that 



sup \6'{uj,t,x)\ < ipix) and / F' (dx) < 00, 1^1,2. 

Under the above assumption we can define the process L" by L" (5' ★ (/i' - ly^) for each ^ = 1, 2. Then, for 
each I = 1 , 2 we have = + L", where X," = - K'{6^{s,x))dsF{dx). Hence we can decompose the 
target quantity as 



sj:-R^(*j)<t 

=:It +nt +IIIt +]IVt, 



where x'\l)« = Y^;;-^ , (£) (x'i - X- ^^_,) , x"(i)^ = Y^^^ g (^) (x- - X- ^,_,) and X^ = + 
C/i , X|^. = + C/|^. for every i, j. 

First, we can adopt an argument similar to the proof of Lemma B.ll for the proof of Ilf = Op{bl/^) and 
nit = Op{bl/^). Next, combining Lemma 4.2 of [31] with an argument similar to the proof of Lemma B.ll we can 
show that It ~ [X'^,X''^]t + Op{bl/^). Finally we consider Wt- By integration by parts we can decompose it as 
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By an argument similar to the proof of Lemma B.ll we can show that ivf ^ = Op(&i^*) and Wl"' = Opibn'"). On 
the other hand, since 



/(2) 



p,q—0 p—i q—i 



we obtain 



(3) 



^ CO I p q 

Thiv X! X! X! 



p,g=l \ i=(p-fc„ + l)Vl j=(q-A:„ + l)Vl 



Moreover, on {/p n J« 7^ 0} we have 5*^"^ < f « and f < S'p , hence for i G {(p - fc„ + 1) V 1, . . . ,p - 1} and 
j e {{q - kn + 1) V 1,. .. ,q - 1} we have < fj+'="-i and < S'+''"-\ so that 1?*^ = 1. Therefore, for 

p,q>k„ we have EL(p-fe„+i)vi E,'=(,-;..„+i)vi 5^-.5,"-,^t' = (^-=1"' 5?)' on {/p n > ^ 0} because 5(0) = 0. 

Since iP_Jl). [L'\L% = l^,-,riJ./0}(^-^-) ' W\L% and (^^^\^). (E-^i"' 5?) ' = 1 + O(fc-i) by the Lipschitz 
continuity of wc obtain 

V 2 

00 /kn — 1 



p,g— 1 \ i—l 



1 

(ibnvk V ^ I ^ 



E 5p"-.5,%i^*^^ J:^) • [L'\L'% + Op (fc-i) 



ivp) + Op (fc-i) 



Since we have 



^(3) _ ivp) 



(ipHYkn) 
1 

(ipHYkn) 



1 °° ^ 

7fc?i2 E E 



9p^i9q-jJ^t -L{i?v(j,j)>t} 

p,g=l Yi=(p-fc„+l)Vl j=(g-fc„+l)Vl 
i,j—0 



(Fj!).[i'\i"]*+Op(&y^) 



we obtain 



;(3) „ ivp) 



^ ^Er^^o ^t^l{iiV(i,j)>t}(^- and thus the Kunita-Watanabe 



inequality and the inequality of arithmetic and geometric means yield 



^(3) _ 1^(3) 



^ 00 



Since Kl'' l^j^v (^^ j-^-^tj is J-'^i^^^ -measurable by Lemma B.IO, we obtain 



E 



00 00 



i,j=0 



hence by [SK2], [SA4] and (B.23) we conclude that 



we obtain IVp' = [L'^,L'^]t + Op{bn^), and thus we complete the proof of the proposition because 
[X'\X'^] + [L'\L% 



,1/4 



^(3) _ jvp) 



Op(fc„r„) + Op(6y^) = Op(6,V'*). Consequently, 



□ 



D Proof of Proposition 4.1 

By a localization procedure, wc may systematically replace the conditions [C1]-[C2], [A4], [A6], [Nj], [T] and 
[N^] with [SC1]-[SC2], [SA4], [SA6], [SN^], [ST] and [SN^] respectively. 
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Set = J2T=u^v "^^ch w e Z+ and / = 1,2. Wc define the random variables (^f and C| by (^f = 
El=o ^i+iCs-" and C| = ELo ^«+iCt3-" ^r every Then we have 

i i i i 

u— li— 1 u— li— 

hence we obtain 

i 

J2 A«C^.-" = AoCs. - (C- - C^l). (D.l) 

In the time series analysis, this relation is known as the Beveridge-Nelson decomposition. See [9] for details. 
Combining (D.l) with Abel's partial summation formula, we obtain 

k„-l / i+p \ fc„-l 

p=0 \u=0 I p=0 

fe„-l 

p=0 

for every i. Similarly we can deduce 

g=0 \ii=0 / g=0 

)_ A:„-l 

9=0 

for every where = for each w e Z+ and ^ = 1,2. Note that Y.°^=\ I^Ll < E^i l-^Ll = 

^^l^'J < and ^^^1 < Y.°^^\ Y.T=u = E^i ^Im'J < oo for Z = 1, 2, by using the above formulas we 
can show that 

PHYiZ\Zy; ~PHY{Z\P)l' -^P and pfm^{Z\Z^)'i^ - RfirY{Z\P)'i^ 

as 71 oo for any t e M+, where Z^. = Z^^+XlCl., + fl^bn^^^ {X}g, -^s-O and Z^, = Z^^+X^C^^ ^^^(^t^ - 

^T3-i) fo'^ each i, J. Consequently, we complete the proof of the proposition due to Proposition 4.1 and Corollary 
3.5. □ 



E'a(.9); 


/i+p 
\u=0 


k —1 


/3+q 


E ^is)- { 

q=0 


E- 

Vu=0 
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